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INTRODUCTION 

Lung  cancer  continues  to  be  the  leading  cause  of  cancer-related  death  in  both  men  and  women 
in  the  United  States  (1).  The  majority  of  lung  cancers  are  non-small  cell  lung  cancers  (NSCLCs) 
that  include  squamous  cell  carcinomas  (SCCs)  and  adenocarcinomas  (2)  Lung  cancer  mortality 
is  high  in  part  because  most  cancers  are  diagnosed  after  regional  or  distant  spread  of  the 
disease  had  already  occurred  and  due  to  the  lack  of  reliable  biomarkers  for  early  detection  and 
risk  assessment  (2).  The  identification  of  new  effective  early  biomarkers  will  improve  clinical 
management  of  lung  cancer  and  is  linked  to  better  understanding  of  the  molecular  events 
associated  with  the  development  and  progression  of  the  disease. 

It  has  been  suggested  that  histologically  normal-appearing  tissue  adjacent  to  neoplastic  lesions 
display  molecular  abnormalities  some  of  which  are  in  common  with  those  in  the  tumors  (3). 
This  phenomenon,  termed  field  of  cancerization,  was  later  shown  to  be  evident  in  various 
epithelial  cell  malignancies,  including  lung  cancer  (4,  5).  Loss  of  heterozygosity  (LOH)  events 
are  frequent  in  cells  obtained  from  bronchial  brushings  of  normal  and  abnormal  lungs  from 
patients  undergoing  diagnostic  bronchoscopy  and  were  detected  in  cells  from  the  ipsilateral  and 
contralateral  lungs  (3).  More  recently,  global  mRNA  expression  profiles  have  been  described  in 
the  normal-appearing  bronchial  epithelium  of  healthy  smokers  (7).  In  addition,  modulation  of 
global  gene  expression  in  the  normal  epithelium  in  health  smokers  is  similar  in  the  large  and 
small  airways  and  the  smoking-induced  alterations  are  mirrored  in  the  epithelia  of  the  mainstem 
bronchus,  buccal  and  nasal  cavities  (8). 

In  this  program,  in  Specific  Aim  1,  high-throughput  microarray  mRNA  expression  analyses  are 
being  performed  on  cytological  specimens  (brushings)  obtained  at  intraoperative  bronchoscopy 
from  the  main  carina  and  main  ipsilateral  bronchus,  as  well  as  on  specimens  obtained  at 
lobectomy  procedures  from  the  main  lobe  bronchus  (adjacent  to  SCCs),  sub-segmental 
bronchus  (adjacent  to  adenocarcinomas)  and  from  the  resected  NSCLC  tumors.  Towards  this 
aim,  we  are  comparing  and  contrasting  global  gene  expression  patterns  across  all  the 
specimens  from  the  entire  field  and  corresponding  NSCLC  tumors.  We  are  deriving  lung 
adenocarcinoma  and  SCC  field  cancerization  signatures  signifying  the  differential  mRNA 
expression  patterns  between  the  carina  and  the  subsegmental  bronchus  and  main  lobe 
bronchus,  respectively.  In  addition,  similar  expression  profiles  between  the  carina  and  resected 
NSCLC  tumors  will  be  integrated  with  available  gene  expression  data  of  bronchial  brushings 
from  the  main  carina  isolated  at  various  time  points  post-surgery  from  40  NSCLC  patients; 
Department  of  Defense  (DoD)  VITAL  patients.  Promising  markers  derived  from  this  study  are 
being  validated  at  the  mRNA  and  protein  level  in  histological  tissue  specimens.  Moreover,  and 
in  collaboration  with  the  Initiating  PI  (Dr.  Steve  Dubinett)  and  other  Partnering  Pis  (Dr.  Avrum 
Spira  and  Dr.  Pierre  Massion)  we  are  currently  performing  RNA-sequencing  and  microarray 
profiling  of  nasal  epithelia,  airway  epithelial  cells  collected  from  both  bronchoscopy  and 
lobectomy  specimens  as  well  as  of  corresponding  tumors  (NSCLC  patients)  or  benign  lesions 
(cancer-free  individuals).  In  future  studies,  in  Specific  Aim  3,  we  will  use  expression  signatures 
and  biomarkers  derived  from  this  aim  of  the  study  to  develop  and  test  airway-based  biomarkers 
capable  of  diagnosing  lung  cancer  in  current  or  former  smokers  using  minimally  invasive  sites. 

This  report  details  the  progress  made  during  the  second  year  of  research.  This  report  will  also 
be  included  in  the  comprehensive  annual  report  to  be  prepared  and  submitted  by  Dr.  Steven 
Dubinett  (UCLA)  as  the  Initiating  PI. 
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MD  Anderson  Cancer  Center  (Dr.  I  Wistuba)  PROGRESS  REPORT 

Molecular  Profiles  for  Lung  Cancer  Pathogenesis  and  Detection  in  U.S.  Veterans 

Specific  Aim  1 :  To  increase  our  understanding  of  the  molecular  basis  of  the 

pathogenesis  of  lung  cancer  in  the  “field  cancerization”  that 
develops  in  current  and  former  smokers. 

Summary  of  Research  Findings 

A.  Gene  and  protein  expression  analysis  of  bronchial  epithelial  samples  obtained  from 
bronchoscopy  from  NSCLC  patients  (Sub-specific  Aim  1A,  1C  and  ID):  This  analysis  was 
performed  on  samples  obtained  from  patients  enrolled  to  the  Vanguard  clinical  trial  supported 
by  the  DoD  VITAL  grant  (W81XWH-04-1-0412,  PI  Dr.  W.K.  Hong),  and  the  gene  profiling  work 
and  analysis,  and  the  protein  validation  study,  was  partially  supported  by  the  Lung  Cancer 
Consortium  grant  reported  here  (W81XWH-1 0-1 -1007). 

Gene  Expression  Analysis:  Patients  on  the  prospective  Vanguard  study  had  definitively 
treated  ES  (I/ll)  NSCLC  and  were  current  or  former  smokers.  Patients  had  bronchoscopies  with 

brushings  obtained  from  the  main  carina  (MC), 
airways  adjacent  (ADJ)  to  the  previously 
resected  tumor  and  from  airways  distant  from 
the  tumor  in  the  ipsilateral  (NON-ADJ)  and 
contralateral  (CONTRA)  lung  at  baseline,  12, 
24  and  36  months  following  resective  surgery 
(Figure  1).  Nineteen  patients  were  selected  for 
the  study  based  on  airway  sampling  of  at  least 
five  different  regions  per  time  point  and 
continuously  up  to  24  or  36  months  (391  airway 
samples  from  nineteen  patients).  Data  from  the 
gene  expression  analysis  have  been  described 
and  detailed  in  the  previous  annual  report 
(Year  1).  Microarray  analysis  identified  gene  features  (n=1 1 65)  that  were  non-uniform  by  site 
and  differentially  expressed  between  airways  adjacent  to  tumors  relative  to  more  distant 
samples  as  well  as  those  (n=1 395)  that  were  significantly  altered  with  time  up  to  three  years.  In 
addition,  gene-interaction  networks  mediated  by  PI3K  and  ERK1/2  were  modulated  in  adjacent 
compared  to  contralateral  airways  and  the  latter  network  with  time.  Key  markers  of  the  identified 
gene-interaction  networks  were  validated  at  the  protein  level  by  immunohistochemical  analysis. 
The  study  highlighted  spatial  and  temporal  cancer-associated  expression  alterations  in  the 
molecular  field  of  injury  of  early  stage  NSCLC  patients  after  definitive  surgery  that  warrant 
further  validation  in  independent  studies.  These  findings  were  submitted  and  have  been 
accepted  for  publication  and  are  currently  in  press  (see  Reportable  Outcomes  and  Appendix). 

Protein  Expression  Analysis:  Our  findings  on  the  modulation  of  PI3K-  and  ERK1/2-mediated 
networks  by  site  and  time  after  surgery  prompted  us  to  examine  the  immunohistochemical  (IHC) 
expression  of  surrogate  markers  of  both  signaling  cascades  in  corresponding  formalin-fixed 
paraffin  embedded  (FFPE)  airway  biopsies.  We  sought  to  examine  expression  of  AKT 
phosphorylated  at  Threonine308  since  phosphorylation  of  this  amino  acid  is  well  known  to  occur 
through  phosphoinositide-dependent  kinase  1  (PDPK1)  following  PI3K  activation.  We  assessed 


Analysis  of  SITE-dependent 
gene  expression  patterns  (ADJ, 
NON-ADJ,  MC,  CONTRA) 

^  Global  gene  expression 
™  analysis 

Analysis  of  TIME-dependent 
gene  expression  patterns 
(baseline,  12,  24  and  36 
months. 

Figure  1.  Schematic  illustrating  sampling  and  collection  of 
bronchial  brushes  from  different  airway  sites  at  different  time 
points  following  resective  surgery  from  early  stage  smoker 
NSCLC  patients  enrolled  in  the  phase  II  cancer  surveillance 
Vanguard  study. 
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by  IHC  the  immunohistochemical  expression  of  phospho-AKT(Thr308)  and  phospho- 
ERK1/2(Thr202/Tyr204)  in  available  and  evaluable  histologically  normal  bronchial  epithelia 
biopsies  (n=324)  corresponding  to  the  brushings  analyzed  by  expression  profiling.  The  intensity 
and  extent  of  cytoplasmic  and  nuclear  pAKT-Thr308  immunostaining  was  evaluated  using  a 
light  microscope  (magnification,  x20).  Cytoplasmic  expression  was  quantified  using  a  four-value 
intensity  score  (0,  none;  1,  weak;  2,  moderate  and  3,  strong)  and  the  percentage  (0-100%)  of 
the  extent  of  reactivity).  A  final  cytoplasmic  expression  score  for  pAKT-Thr308  was  obtained  by 
multiplying  the  intensity  and  reactivity  extension  values  (range,  0-300).  Nuclear  expression  was 
quantified  using  the  percentage  of  extent  of  reactivity  which  gave  rise  to  a  nuclear  expression 
score  for  pAKT-Thr308  (range,  0-100). 

Immunoreactivity  of  phospho-AKT  (min  0,  max  300)  was  variable  as  depicted  in  the 
representative  photomicrographs  in  Figure  2A  and  was  detected  in  both  the  cytoplasm  and 
nucleus  of  normal  bronchial  epithelia  (NBE)  (Figure  2A).  IHC  analysis  demonstrated  that 
cytoplasmic  (p<0.0001)  and  nuclear  (p=0.01)  phospho-AKT  statistically  significantly  increased 
with  time  up  to  three  years  in  all 
NBE  (Figure  2B)  with  highest 
expression  at  the  36  month  time 
point.  Nuclear  phosphorylated  AKT 
was  also  statistically  significantly 
increased  in  adjacent  NBE 
compared  to  airways  relatively 
more  distant  from  tumors  in  the 
mixed-effects  model  (p=0.04) 

(Figure  2B).  Immunoreactivity  of 
phospho-ERK1/2  was  also  variable 
(min  0,  max  209)  and  mainly 
cytoplasmic  (Figure  2C).  IHC 
analysis  demonstrated  that 
phosphorylated  ERK1/2  was 
statistically  significantly  elevated  in 
adjacent  NBE  (p=0.03)  and 
significantly  increased  with  time  up 
to  three  years  in  all  airways  when 
averaged  together  (p=0.02) 

(Figure  2D)  in  the  model.  Notably, 
there  was  a  significant  interaction 
(p=0.019)  between  site  of  NBE  and 
time  of  sampling  as  phospho- 
ERK1/2  expression  was 
significantly  increased  in  adjacent 
NBE  but  not  in  contralateral 
airways  and  main  carinas  in  the  model  (Figure  2D)  with  highest  expression  in  adjacent  and 
non-adjacent  (ipsilateral,  green  plot)  airways.  These  data  demonstrated  that  like  differential 
gene  expression  profiles  within  the  lung  airway  field  of  injury,  canonical  activated  oncogenes  are 
modulated  by  site  from  the  resected  tumor  and  time  following  definitive  surgery  in  early  stage 
NSCLC  patients  and  have  been  also  accepted  for  publication  along  with  the  gene  expression 
data  above  (Reportable  Outcomes  and  Appendix). 


p-AKT(Th  r308) 


Cytoplasmic  p-AKT(Thr308) 


—ADJ 

100' 

—  NON-ADJ 

200' 

—  MC 

m 

—  CONTRA  [ 

&150' 

a 

yioo. 

40' 

Effect  pv.ikie 

50' 

Sit*  0.241 

20' 

Time  <.0001 

O' 

Site  Time  0.6)8 

O' 

Nuclear  p-AKTCmr308) 


p-ERK1/2(Thr202ytyr2Q4) 


12  24 

Time 


p-ERK1/2(Thr202/lyr204) 
■=aTu - 


Effect 

pv.ikie 

—  ADJ 

Site 

0.041 

—  NON-ADJ 

Time 

0.012 

Site  Tune 

MSI 

0 

12 

24  36 

Time 

Figure  2.  A.  Representative  photomicrographs  (20x  magnification)  depicting 
strong  (top)  and  weak  (bottom)  phospho-AKT(Thr308)  immunostaining.  B. 
Immunohistochemical  scores  of  cytoplasmic  (left)  and  nuclear  phospho-AKT 
(middle)  were  assessed  for  statistically  significant  differences  by  site  and  time 
in  a  mixed-effect  model  and  plotted  in  main  carinas  (MC)  and  in  adjacent 
(ADJ),  non-adjacent  (ipsilateral,  NON-ADJ)  and  contralateral  (CONTRA) 
airways  with  time.  C.  Representative  photomicrographs  (20x  magnification) 
depicting  strong  (top)  and  weak  (bottom)  phospho-ERK1/2(Thr202/Tyr204) 
immunostaining.  D.  Immunohistochemical  scores  of  phosphorylated  ERK1/2 
levels  were  assessed  for  statistically  significant  differences  by  site  and  time  in 
a  mixed-effect  model  and  repeated  measures  analysis,  site'time,  term  for 
interaction  between  site  and  time  factors.  Error  bars  represent  S.E.  of  the 
mean. 
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B.  Gene  expression  analysis  of  bronchial  epithelial  samples  obtained  from  lobectomy 
specimens  from  NSCLC  patients  ( Field  Cancerization  Study)  (Sub-specific  Aim  1A  and 
1C): 

Gene  Expression  Analysis:  To  increase  our 
understanding  of  the  molecular  basis  of  lung  cancer 
pathogenesis,  we  analyzed  the  transcriptome  profiles 
of  cytological  specimens  (brushings)  obtained  at 
lobectomy  or  pneumonectomy  procedures  from  2-5 
bronchioles  with  differential  proximity  from  resected 
tumors  and  from  resected  lung  tumors  and  normal 
parenchyma  (Figure  3).  Samples  were  obtained  from 
patients  who  did  not  receive  neoadjuvant  therapy 
undergoing  lobectomy  or  pneumonectomy  procedures 
(n=23)  under  an  MD  Anderson  institutional  review 
board  (IRB)-approved  protocol.  A  summary  of  the 
clinicopathological  characteristics  of  the  studied  patient 
cases  has  been  described  in  the  previous  annual 
report  (Year  1).  194  samples  were  obtained  from 

surgically  resected  specimens  of  early-stage  (l-IIIA) 

NSCLC  patients  (n=20  cases).  NSCLCs  and  paired 
normal  lung  were  obtained  frozen,  preserved  in 
RNA later  or  by  surface-brushing.  Normal  bronchial 
epithelia  (NBE)  were  collected  by  brushing  bronchial 
structures  using  Cytosoft  cytology  brushes  (Cardinal 
Health).  Histological  assessment  of  NBE  by 
hematoxylin  &  eosin  (H&E)  and  pan-cytokeratin  IHC 
analysis  confirmed  the  lack  of  preneoplastic  and 
neoplastic  cells  and  revealed  greater  than  90%  epithelial  content,  respectively  (Figure  4).  Total 
RNA  of  all  samples  was  isolated  using  the  miRNeasy  Mini  kit  following  homogenization  of 

tissues  and  brushing  collections.  RNA  quality  was 
assessed  by  the  28S/18S  ribosomal  RNA  ratio  and  by 
RNA  integrity  number  (RIN)  using  the  Agilent 
Bioanalyzer.  Processed  RNA  samples  were  hybridized 
to  Affymetrix  GeneChip®  Human  Gene  1.0  ST  arrays 
and  were  then  scanned  using  the  GeneChip®  Scanner 
3000  from  Affymetrix  (Santa  Clara,  CA)  to  yield  raw 
image  files  that  were  subsequently  converted  to  probe 
set  data.  Raw  data  were  normalized  and  processed 
using  the  robust  multi-chip  algorithm  (RMA)  algorithm. 
We  then  applied  linear  mixed-effects  models  and 
ANOVA-Tukey  tests  to  identify  differentially  expressed 
genes  among  groups  based  on  feature-by-featured  t- 
test  p-values  and  false  discovery  rates  (FDRs). 


NSCLC 


Normal  lung 


Airway 


Figure  4.  Representative  H&t  analysis  (A)  of  an 


NSCLC  (left),  paired  normal  lung  tissue  (middle) 
and  airway  (right).  B.  Representative  cytokeratin 
staining  of  an  airway/bronchial  structure  showing 
positive  immunoreactivity. 


Figure  3.  Schematic  illustration  of  the  lung 
airway  for  molecular  mapping  of  field 
cancerization  by  high-throughput  profiling. 

Tumor  (T)  and  normal  parenchyma  (NP) 
samples  were  obtained  by  brushing, 
preservation  in  RNA/afer  and  snap-freezing  in 
liquid  nitrogen.  Airway  samples  (B1  to  B5)  were 
obtained  by  brushing. 


We  then  sought  to  derive  gene  expression  signatures  signifying  genes  significantly  and 
concurrently  differentially  expressed  between  both  tumors  and  airways  compared  to  normal  lung 
tissue  (field  cancerization  signatures).  We  first  analyzed  tumors,  airways  and  normal  lung 
pertaining  to  NSCLC  cases  (n=20)  excluding  the  three  carcinoid  cases.  Following  linear-mixed 
effects  model  coupled  with  ANOVA-Tukey  analysis,  we  identified  457  and  1204  genes  to  be  up- 
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regulated  and  down-regulated,  respectively,  between 
both  NSCLCs  and  airways  compared  to  normal  lung 
tissue  based  on  a  statistical  threshold  of  a  p- 
value<0.01  as  well  as  a  1.5  fold-change  in  expression. 

Hierarchical  clustering  analysis  demonstrated  that 
these  genes  were  able  to  separate  NSCLC  tumor 
samples  and  airway  samples  from  normal  lung  tissues, 
the  latter  being  segregated  in  a  separate  cluster 
(Figure  5). 

We  then  sought  it  worthwhile  to  derive  such  field 
cancerization  signatures  when  analyzing  squamous 
cell  carcinoma  and  lung  adenocarcinoma  cases  (the 
two  major  subtypes  of  NSCLC)  separately.  Similar 

analysis 
was 

performed 
using  the 
same 
selection 

criteria  mentioned  above.  We  derived  field 
cancerization  signatures  denoting  genes  significantly 
and  concordantly  differentially  expressed  between 
both  lung  tumors  and  airways  compared  to  paired 
uninvolved  normal  lung  tissue  for  both  lung  SCC 
(n=5)  and  adenocarcinoma  (n=14)  cases  that  were 
comprised  of  1,803  and  1,735  genes,  respectively 
(Figures  6A  and  6B). 

We  then  sought  to  functionally  analyze  the  molecular 
field  cancerization  in  NSCLC.  We  performed 
pathways  and  gene-interaction  network  analysis  of 
the  1,661  differentially  expressed  genes  described  in  Figure  5  using  Ingenuity  Pathways 
Analysis  (Redwood,  CA).  The  top  functional  pathway  and  gene-interaction  network  are 

represented  in 
Figure  7A  and 
Figure  7B, 
respectively. 
Certain 
molecules  within 
the  top 

functional 
pathway  {MET, 
TGFBR2 )  and 
gene-interaction 
network 
{GPRC5A, 
VIPR1)  are 
highlighted  in 
bold  as  they 
have  been 
selected  for 

urther  confirmation  and  validation  of  expression  in  subsequent  studies  (see  below).  It  is 


Figure  7.  Functional  pathways  analysis  of  NSCLC  molecular  field  cancerization.  Functional 
pathway  and  gene-interaction  network  analysis  of  the  NSCLC  field  cancerization  signature  derived  in 
Figure  5.  The  top  functional  pathway  and  gene-interaction  network  are  presented  in  A  and  B, 
respectively.  Molecules  selected  for  further  expression  validation  are  in  bold,  red,  up-regulated:  green, 
down-regulated). 


Normal  lung 
Airway 

cases 

Tumors 


Figure  6.  Field  cancerization  expression  profiles 
in  early-stage  lung  SCCs  and  adenocarcinomas. 

Hierarchical  clustering  of  genes  significantly 
differentially  expressed  between  both  tumors  and 
airways  compared  to  normal  lung  in  SCC  (A, 
n=1,803  genes)  and  adenocarcinoma  (ADC)  (B, 
n=1,735  genes)  cases.  Columns  represent  samples 
and  rows  represent  gene  features  (red,  up-regulated; 
green,  down-regulated). 


Figure  5.  Field  cancerization  expression 
profiles  in  early-stage  NSCLC.  Hierarchical 
clustering  of  genes  significantly  differentially 
expressed  between  both  tumors  and  airways 
compared  to  normal  lung  in  NSCLC  (n=1 ,661 ). 
Columns  represent  samples  (n=1 94  samples  from 
20  cases)  and  rows  represent  gene  features  (red, 
up-regulated;  green,  down-regulated). 
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noteworthy  that  functional  analysis  revealed  that  the  identified  molecular  field  cancerization  in 
NSCLC  was  comprised  of  typical  cancer-associated  pathways  and  networks  such  as  embryonic 
stem  cell,  eicosanoid,  retinoic  acid  receptor  (RAR),  NANOG  stem  cell,  NF-kB  and  PTEN 
signaling  pathways  and  gene  networks.  These  findings  highlight  expression  patterns  and 
pathways  that  are  typically  deregulated  in  overt  tumors  but  also  prevalent  in  histologically 
normal  airway  epithelia,  and  therefore,  may  play  important  roles  in  the  pathogenesis  of  lung 
adenocarcinomas  and  SCCs. 

The  above  findings  highlight  expression  patterns  and  pathways  in  the  molecular  field 
cancerization  of  NSCLC  that  may  offer  both  detection  markers  as  well  as  therapeutic  targets  to 
guide  personalized  chemoprevention  in  early  stage  NSCLC  patients.  To  explore  this  concept, 
we  deemed  it  worthwhile  to  assess  the  relevance  of  the  localized  field  cancerization  expression 
profiles  identified  to  those  of  large  airways  in  lung  cancer  patients  and  high  risk  smokers.  In 
collaboration  with  Dr.  Avrum  Spira’s  group  (Partnering  PI,  Boston  University),  we  analyzed 

the  field  cancerization 
profiles  in  the  publicly 
available  and 

published  cohort  by 
Spira  et  al  (9) 
comprised  of  129 
proximal  airway 
samples.  The  gene 
expression  raw  data 
of  airway  epithelium  of 
smokers  with  suspect 
lung  cancer  were  re¬ 
normalized  to  produce 
gene-level  expression 
values  using  RMA 
{10)  in  the  affy 
package  {11)  of  the 
Bioconductor  software 
suite  (version  2.4.1) 
{12)  and  Entrez  gene- 
specific  probe  set 
mapping  from 

BrainArray  (version 

11.0.1)  {13).  A  linear  model  was  then  used  to  adjust  the  expression  estimates  for  the  mean  z- 
score  quality  metric  previously  described  in  the  original  report.  We  then  performed  Gene  Set 
Enrichment  Analysis  (GSEA)  {14)  and  a  list  of  all  Entrez  gene  identifiers  within  the  field 
cancerization  signature  was  ranked  according  to  the  Student  t  statistic  computed  between 
proximal  airways  from  cancer  and  non-cancer  patients  from  the  original  report  by  Spira  et  al. 
This  list  was  then  used  to  perform  a  pre-ranked  GSEA  analysis  to  identify  which  of  the  field 
cancerization  gene  sets  were  significantly  enriched  among  genes  that  were  up-  or  down- 
regulated  in  the  normal  bronchial  epithelium  of  smokers  with  suspect  lung  cancer.  A  leading 
edge  gene  set  was  then  derived  that  was  comprised  of  299  genes  that  were  concordantly 
modulated  between  NSCLCs  and  airways  compared  to  normal  lung  in  the  localized  field 
cancerization  as  well  as  between  proximal  large  airways  of  cancer  patients  compared  to  non¬ 
cancer  patients  (Figure  8).  Side  by  side  hierarchical  cluster  analysis  using  the  leading  edge 
genes  of  the  localized  field  cancerization  samples  (left  panel)  and  of  the  large  airways  of  cancer 
and  non-cancer  patients  (right  panel)  revealed  concordance  in  differential  expression  among 
the  sub  groups  (Figure  8).  These  data  suggest  that  expression  profiles  from  the  localized  field 


Figure  8.  Analysis  of  the  localized  field  cancerization  In  uninvolved  mainstem  bronchi  of 
patients  with  suspect  lung  cancer.  GSEA  analysis  was  performed  to  identify  genes  (n=299) 
differentially  expressed  between  tumors  and  airways  compared  to  normal  lung  in  the  localized 
field  cancerization  (FC)  of  NSCLCs  that  are  also  significantly  and  concordantly  differentially 
expressed  between  mainstem  bronchi  of  cancer  and  non-cancer  patients.  Hierarchical  cluster 
analysis  using  the  299  leading  edge  genes  was  performed  side  by  side  in  the  localized  FC  (left 
panel)  and  mainstem  bronchi  of  cancer  (green)  and  non-cancer  (grey)  patients  (right  panel). 
Columns  represent  samples  and  rows  represent  gene  features  (red,  up-regulated;  blue,  down- 
regulated). 
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cancerization  of  resected  NSCLC  specimens  may,  in  part,  be  relevant  to  differential  field  of 
injury  molecular  profiles  between  cancer  and  non-cancer  patients  and  may  comprise  potential 
targets  that  can  be  used  for  both  detection  and  chemoprevention  strategies. 

We  then  sought  to  further  study 
the  transcriptomic  architecture  of 
the  localized  field  cancerization  to 
determine  whether  the  molecular 
field  cancerization  exhibits  site  or 
distance  from  tumor-dependent 
expression  patterns.  Ordinal 
logistic  analysis  of  airway  samples 
alone  identified  genes  significantly 
progressively  and  differentially 
expressed  by  distance  from  the 
tumors  (n=422)  (Figure  9A). 

Functional  pathways  analysis  of 
the  genes,  using  Ingenuity 
Pathways  Analysis  (Redwood,  CA) 
demonstrated  that  canonical 
cancer-associated  pathways,  such  as  eukaryotic  initiation  factor,  p70S6K  kinase,  polo-like 
kinase  (PLK),  gluconeogenesis  and  mammalian  target  of  rapamycin  (mTOR)  signaling,  were 
significantly  and  progressively  modulated  in  the  airway  epithelia  by  distance  from  the  tumor 
(increased  with  shorter  distance  from  tumors)  (Figure  9B). 

We  then  derived  a  bioinformatic  index  score, 
signifying  the  extent  of  the  site-dependent 
modulation  of  the  genes  in  the  airway 
cancerization,  by  computing  the  difference  of 
expression  values  of  the  two  clusters  exhibited 
in  the  heatmap  in  Figure  9A  across  samples. 
The  score/index  quantitatively  assessed  the 
site-dependent  effect  and  was  increased  with 
shorter  distance  of  the  airways  from  the  tumors 
(Figure  10A).  Notably,  the  extent  of  the  site- 
dependent  molecular  field  cancerization  effect 
was  more  pronounced  in  lung  SCC  cases  and 
increased  at  a  much  larger  rate  with  shorter 
distance  from  the  tumors  in  this  NSCLC 
subtype  compared  to  adenocarcinomas 
(Figures  10B  and  10C)  which  may  be  related 
in  part  to  the  proximal  location  of  the  sampled 
airways  to  SCCs  relative  to  adenocarcinomas. 
However,  it  is  noteworthy  that  we  did  not  find 
significant  differences  in  site-dependent 
expression  of  members  of  the  surfactant  family 
such  as  SFTPB  and  S FTPC. 

Since  we  performed  the  regression  analysis  on  airways  alone,  we  then  deemed  it  worthy  to 
determine  whether  the  identified  significant  site-dependent  gene  expression  modulation  in  the 
airways  follow  the  same  pattern  between  matched  tumors  and  uninvolved  normal  lung  tissue. 
Importantly,  the  site-dependent  genes  signified  by  the  cancerization  index  score  were 
collectively  significantly  and  concordantly  modulated  between  tumors  and  paired  uninvolved 
normal  lung  tissues  (Figure  10D).  It  is  important  to  note  that,  although  the  site-dependent 


< d  8 


Airways 


molecular  airway  field 
cancerization  is  more  pronounced  in  lung  SCCs  relative 
to  adenocarcinomas  and  informs  of  corresponding 
tumor  profiles.  Boxplots  depicting  site-dependent  molecular 
airway  field  cancerization  modulation  in  early-stage  NSCLCs 
(A),  adenocarcinomas  (B),  lung  SCCs  (C)  and  between 
paired  NSCLCs  and  normal  lung  tissue  (D).  Airway  distance 
from  tumor  is  numerically  indicated  with  a  range  of  1 
(closest)  to  5  (farthest). 
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Figure  9.  Site-dependent  modulation  of  the  molecular  field  cancerization 
in  early-stage  NSCLC.  A.  Ordinal  logistic  regression  analysis  of  airways 
identifying  422  genes  modulated  in  expression  by  distance  from  the  tumor. 
Columns  represent  samples  and  rows  represent  gene  features  (red,  up- 
regulated;  green,  down-regulated).  B.  Functional  pathways  analysis  of  the 
genes  using  Ingenuity  Pathways  Analysis  software. 
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molecular  field  cancerization  expression  profiles  were  concordantly  modulated  between 
corresponding  NSCLCs  and  matched  normal  lung  tissue,  they  did  not  significantly  correlate  with 
expression  profiles  of  the  proximal  large  airways  (mainstem  bronchi)  of  patients  with  suspect 
lung  cancer.  GSEA  analysis  of  the  aforementioned  422  genes  (clustered  in  Figure  9)  did  not 
reveal  significant  enrichment  of  this  expression  signature  in  the  published  dataset  by  Spira  et  al 
(9).  These  findings  suggest  that  unlike  global  localized  field  cancerization  signatures  we 
described  above,  site  (from  tumor)-dependent  expression  differences  in  the  localized  molecular 
field  cancerization  may  not  be  relevant  to  the  extended  airway  field  of  injury.  Nonetheless,  it  is 
plausible  to  assume  that  the  site-dependent  localized  molecular  field  cancerization  profiles 
comprise  expression  profiles  with  typical  cancer-associated  signaling  pathways  and  inform  of 
tumor  expression  patterns  and,  thus,  of  potential  early  mechanisms  of  lung  cancer  pathogenesis 
commencing  in  normal  airway  epithelia. 


Expression  validation 

studies:  We  then 

sought  to  validate,  in 
part,  expression  of  field 
cancerization  markers. 

We  selected  field 

cancerization  markers 
that  displayed  site- 
dependent  or 
independent  differential 
expression  following 
microarray  profiling 
analysis  of  the 

localized  field 

cancerization  based  on 
both  statistical  criteria 

and  functional 

pathways  analysis.  Out 
of  the  NSCLC  cases 
studies,  18  comprised 
samples  with  enough 
RNA  left  after  array 
profiling  for  performing 
quantitative  real-time 
polymerase  chain 
reaction  (QRTPCR) 
analysis.  Microarray 
profiling  identified 
lysosome  associated 
protein  transmembrane 
4  beta  (LAPTM4B)  and 
neuropilin  (NRP)  and 
tolloid  (TLL)-like  2 
(NET02)  to  exhibit 
site-dependent 
differential  expression 
(increasing  in  airways 

closer  to  corresponding  tumors)  in  the  localized  field  cancerization.  Both  genes  were  not 
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Figure  11.  QRTPCR  analysis  of  airway  field  cancerization  markers.  Expression  of  LAPTM4B  (A), 
NET02  (B),  TGFBR2  (C)  and  VIPR1  (D)  was  analyzed  by  QRTPCR  in  18  out  of  20  cases  with  sufficient 
RNA  from  samples.  QRTPCR  analysis  was  performed  in  a  site-dependent  manner  (left  panels)  and  in  a 
site-independent  manner  comparing  tumors,  airways  and  normal  lung  tissues  (middle  panels).  Correlation 
between  expression  quantified  by  microarray  and  QRTPCR  analyses  was  statistically  assessed  by 
Spearman  rank  (right  panels). 
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identified  in  the  global  field  cancerization  signatures  derived  (Figures  5  and  6)  when  all 
samples  were  analyzed  independent  of  site.  Transforming  growth  factor  receptor  beta  II 
(TGFBR2)  was  found  to  be  decreased  in  the  molecular  field  cancerization  in  a  site-independent 
manner  and  vasoactive  intestinal  peptide  receptor  1  (VIPR1)  exhibited  both  site-dependent  and 
independent  decreased  expression  in  the  molecular  field  cancerization.  We  performed 
QRTPCR  analysis  of  LAPTM4B,  NET02,  TGFBR2  and  VIPR1  on  all  evaluable  cases  (Figures 
11  A-D).  QRTPCR  analysis  confirmed  the  microarray  findings  of  the  differential  expression 
pattern  of  all  four  genes  in  the  molecular  field  cancerization  (Figures  11  A-D).  LAPTM4B  and 
NET02  were  significantly  increased  in  airways  closer  to  tumors  compared  to  more  distant 
normal  epithelia  (p<0.05)  (Figures  11A-B,  left  panels).  Similar  to  the  array  findings,  LAPTM4B 
and  NET02,  although  increased  in  tumors  compared  to  normal  lung  tissue,  exhibited  similar 
levels  of  expression  among  airways,  when  averaged  all  together,  and  normal  lung  (Figures 
11A-B,  middle  panels).  Moreover  and  closely  similar  to  the  microarray  data,  TGFBR2  and 
VIPR1  mRNA  expression  were  decreased  in  tumors  and  airways  compared  to  normal  lung 
tissue  (Figures  11C-D,  middle  panels).  Whereas  TGFBR2  did  not  exhibit  site-dependent 
differential  expression  patterns  (Figure  11C,  left  panel),  VIPR1  was  progressively  decreased  in 
airways  with  closer  distance  from  corresponding  tumors  (Figure  1 1 D,  left  panel).  Importantly, 
we  found  significant  and  high  correlation  between  expression  of  the  four  genes  analyzed  by 

microarray  and  expression 
quantified  by  QRTPCR  (all 
p<0.0001)  (Figures  11  A-D, 
right  panels).  We  next 
performed  western  bot  analysis 
of  the  20  NSCLC  field 
cancerization  case  tumors  and 
corresponding  uninvolved 

normal  lung  tissues  for 
assessing  expression  of  the 
protein  products  coding  for 
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Figure  12.  Western  blotting  analysis  of  field  cancerization  markers  in 
NSCLCs  and  corresponding  normal  lung  tissues.  NSCLC  and  matched 
normal  lung  tissues  (20  field  cancerization  case  pairs)  were  lysed  and 
homogenized  after  which  20  pg  of  the  total  protein  homogenates  were  subjected 
to  SDS-PAGE.  Membranes  were  incubated  with  primary  antibodies  raised 
against  NET02  and  TGFBR2.  Membranes  were  also  incubated  with  antibody 
against  proliferating  cell  nuclear  antigen  (PCNA)  to  confirm  tumor  and  normal 
samples  and  with  (5-actin  to  ensure  equal  loading. 


NET02  and  TGFBR2.  Western  blot  analysis  confirmed  the  tumor  versus  normal  differential 
expression  of  the  field  cancerization  markers  at  the  protein  level  (Figure  12).  It  is  noteworthy 

that  we  were  not  successful 
to  assess  the  protein 
expression  of  LAPTM4B  as 
the  limited  antibodies 
available  for  this  recently 
identified  gene  did  not  work 
well  in  our  hands. 
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Figure  13.  Validation  of  increased  expression  of  the  MET  oncogene  in  the  field 
cancerization.  Quantitative  real-time  PCR  analysis  of  MET  mRNA  expression  in 
adenocarcinomas  and  matched  airways  and  normal  lung  (A  and  B).  C.  Representative 
MET  immunohistochemical  expression  in  a  case  comprised  of  a  tumor,  uninvolved 
normal  lung  and  five  bronchial  structures.  D.  Correlation  plot  of  MET  mRNA  and  protein 
expression  in  the  same  cases  by  Pearson  correlation. 


We  also  confirmed 
expression  of  additional  field 
cancerization  markers. 
Microarray  analysis 

revealed  that  the  MET 
oncogene  was  increased  in 
the  molecular  field 

cancerization,  and  did  not 
exhibit  site-dependent 
differential  expression 
features.  We  analyzed  MET 
mRNA  and  protein 
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expression  by  QRTPCR  and  immunohistochemical  analysis,  respectively,  in  an  independent  set 
of  cases  and  samples  we  had  not  profiled  (n=6  cases).  QRTPCR  validated  the  microarray 
findings  in  the  independent  set  and  demonstrated  that  MET mRNA  expression  was  significantly 
higher  in  both  lung  tumors  and  airways  compared  to  normal  lung  tissue  ( p<0.0001 )  (Figures 
13A-B).  Immunohistochemical  analysis  confirmed  the  array  and  QRTPCR  findings  at  the  protein 
level  and  demonstrated  increased  total  MET  immunohistochemical  expression  in  tumor  and 

normal  bronchial  epithelial  histological 
tissue  sections  compared  to  those  of 
normal  lung  tissue  (Figure  13C).  In 
addition  and  importantly,  we  found 
statistically  significant  correlation 
(p<0.001)  between  MET  mRNA 
assessed  by  QRTPCR  and  protein 
expression  analyzed  by  IHC  in  the 
same  samples  studied  (Figure  13D). 
Our  microarray  analysis  also 
pinpointed  to  a  decreased  expression 
of  the  lung-specific  tumor  suppressor 
gene  G-protein  coupled  receptor 
family  C,  group  5,  member  A 
(■ GPRC5A )  in  tumors  and  airways 
compared  to  uninvolved  normal  lung 
tissue.  We  performed  QRTPCR 
analysis  of  GPRC5A  mRNA 
expression  in  NSCLCs,  normal  lung 
tissue  and  bronchial  epithelia  from  six 
independent  cases  we  had  not  profiled.  QRTPCR  analysis  confirmed  the  microarray  findings 
and  demonstrated  that  GPRC5A  expression  was  significantly  decreased  in  the  molecular 
localized  field  cancerization  (Figures  14A-B).  GPRC5A  was  identified  as  a  lung-specific  tumor 
suppressor  gene  evidenced  by  spontaneous  adenocarcinoma  formation  in  mice  with  knockout 
of  both  of  the  gene’s  alleles  {15).  More  recently,  GPRC5A  was  shown  to  exert  its  tumor 
suppressive  function,  in  part,  by  inhibition  of  nuclear  factor-kappa  B  (NFkB)  and  downstream 
inflammation  {16).  Therefore,  we  sought  to  analyze  GPRC5A  expression  in  the  molecular  field 
cancerization  associated  with  chronic  obstructive  pulmonary  disease  (COPD),  a  risk  factor  for 

lung  cancer  that  is 
typically  associated 
with  inflammation 
{17).  GPRC5A 

expression  was 
assessed  in 

histological  tissue 
specimens  of  NSCLC 
and  normal  bronchial 
epithelia  including 
those  from  lung 
cancer-free  COPD 
and  from  NSCLC 
patients. 
Representative 
photomicrographs  of 
GPRC5A 


N:  13  18  24  14  12 

Figure  15.  GPRC5A  expression  in  COPD-associated  field  cancerization.  A.  Representative 
photomicrographs  of  GPRC5A  expression  in  NBE  from  disease-free  never-  and  ever-smokers 
(upper  left  and  right,  respectively)  and  in  NBE  from  cancer-free  COPD  patients  (bottom  left)  and 
NSCLC  patients  with  COPD  (bottom  right).  B.  Box-plots  depicting  quantification  of  GPRC5A 
immunohistochemical  expression  represented  in  (A).  Main  P-value  signifies  statistical 
significance  of  reduced  GPRC5A  expression  in  the  sequential  pathogenesis  of  COPD- 
associated  NSCLC  and  among  the  five  groups  by  the  general  linear  model  and  Kruskal-Wallis 
test. 


■  NSCLC 

□  Normal  lung  parenchyma 

■  Bronchial  structures 


Figure  14.  QRTPCR  analysis  of  GPRC5A  expression  in  the  localized 
field  cancerization.  Total  RNA  isolated  from  tumors,  normal  lung  tissue 
and  normal  bronchial  epithelia  from  six  NSCLC  cases  (A)  was  analyzed 
by  QRTPCR  for  GPRC5A  expression.  B.  Box  plot  showing  average 
GPRC5a  expression  in  NSCLCs,  bronchial  epithelia  and  normal  lung 
tissue  statistically  analyzed  by  the  Kruskal-Wallis  test. 
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immunohistochemical  staining  in  the  different  normal  bronchial  epithelia  is  depicted  in  Figure 
15A.  GPRC5A  immunoreactivity  appeared  to  be  highest  in  normal  bronchial  epithelia  from  lung 
disease-free  never-  and  ever-smokers  and  successively  significantly  decreased  in  NBE  of 
COPD  patients  who  are  cancer-free  and  lowest  in  NBE  of  patients  with  both  COPD  and 
adenocarcinoma  or  SCC  (Figure  15B).  General  linear  model  demonstrated  a  significant  gradual 
decrease  of  GPRC5A  expression  from  NBE  of  disease-free  never-smokers  to  NBE  from 
patients  with  COPD  and  adenocarcinoma  or  SCC  (p<0.0001)  (Figure  15B).  The  mean  (177.31 
±  30.66)  and  median  (195;  min,  130;  max,  220)  of  GPRC5A  expression  in  NBE  of  lung  disease- 
free  never-smokers  were  higher  than  that  of  smokers  (mean,  155.28  ±  34.15,  median,  147.5; 
min,  100;  max,  230)  although  the  difference  was  not  statistically  significant  (Figure  15B). 
Moreover,  GPRC5A  expression  was  significantly  higher  in  NBE  of  lung  disease-free  smokers 
relative  to  NBE  in  cancer-free  COPD  patients  (p=0.007),  which  in  turn  was  significantly  higher 
compared  to  the  expression  in  NBE  from  COPD  patients  with  adenocarcinoma  (p<0.0001)  or 
SCC  (p<0.0001)  (Figure  15B).  Furthermore,  although  GPRC5A  expression  was  lower  in  NBE  of 
COPD  patients  with  SCC  compared  to  those  with  COPD  and  adenocarcinoma,  the  difference 
was  not  statistically  significant  (Figure  15B).  These  findings  suggest  that  reduced  expression  of 
the  GPRC5A  tumor  suppressor  may  be  implicated  in  the  pathogenesis  of  NSCLC  associated 
with  inflammatory  COPD.  Our  findings  on  the  differential  expression  of  GPRC5A  in  the 
molecular  field  cancerization  including  that  associated  with  COPD  have  been  accepted  for 
publication  and  are  in  press  (see  Reportable  Outcomes  and  Appendix). 

Functional  studies:  As  mentioned 
above,  we  confirmed  and  validated 
the  differential  expression  of  selected 
field  cancerization  markers  in 
matched  NSCLCs,  bronchial  epithelia 
and  uninvolved  normal  lung  tissue. 

Some  of  the  above  field  cancerization 
markers  exhibit  well  reported  roles  in 
lung  cancer  pathogenesis  (e.g.,  MET, 

GPRC5A)  whereas  other  have  either 
been  recently  shown  to  function  in 
lung  carcinogenesis  [TGFBR2  (18, 

19)]  or  have  not  been  reported  to 
exhibit  aberrant  function  in  the 
pathogenesis  of  lung  malignancies 
(e.g.,  LAPTM4B,  NET02).  We  were 
prompted  to  assess  the  expression 
and  potential  functional  relevance  of 
the  latter  group  of  under  studied 
genes  in  an  in  vitro  model  of  lung  carcinogenesis  comprised  of  normal  (NHBE),  immortalized 
(BEAS-2B  and  1799)  and  transformed  (1198)  and  tumorigenic  (1170)  bronchial  and  lung 
epithelial  cells  (20). 

QRTPCR  analysis  demonstrated  that,  in  accordance  with  our  findings  on  differential  expression 
patterns  in  the  molecular  field  cancerization,  LAPTM4B  and  NET02  were  both  up-regulated  in 
tumorigenic  lung  epithelial  cells  compared  to  their  normal  counterparts  whereas  TGFBR2  was 
down-regulated  in  the  tumorigenic  cells  (Figures  16A-C).  Moreover,  the  differential  expression 
of  NET02  and  TGFBR2  in  the  utilized  in  vitro  model  was  confirmed  at  the  protein  level  by 
western  blot  analysis  (Figure  16D).  These  findings  demonstrate  that  novel  field  cancerization 
markers  may  have  important  relevance  to  lung  cancer  pathogenesis.  Moreover,  our  field 
cancerization  profiling  confirm  recent  reports  by  other  groups  (18,  19)  that  shed  light  on  the 


TGFBR2 


NHBE  BEAS-2B  1799  1198  1170 

Figure  16.  Differential  expression  of  selected  field  cancerization 
markers  in  an  in  vitro  model  of  lung  carcinogenesis.  QRTPCR 
analysis  of  LAPTM4B  (A),  NET02  (B)  and  TGFBR2  (C)  expression  in 
normal  (NHBE),  immortalized  (BEAS-2B  and  1799)  and  transformed  and 
tumorigenic  (1198  and  1170)  lung  epithelial  cells.  D.  Western  blot  analysis 
of  NET02  and  TGFBR2  protein  expression  in  the  in  vitro  model. 
Membranes  were  stained  with  antibody  against  actin  to  ensure  equal 
loading. 
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function  of  new  genes  in  lung  cancer  pathogenesis  and,  thus,  provide  confidence  in  functionally 
exploring  field  cancerization  markers  we  had  identified  and  that  have  not  been  reported  to  be 
implicated  in  lung  carcinogenesis. _ 

Microarray 


LAPTM4B 
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associate  transmembrane 
putative  oncogene  with  no 
known  role  in  lung 
carcinogenesis  (21),  was 
among  the  top  5  site- 
dependent  field  cancerization 
markers.  Moreover  and  in  our  hands,  LAPTM4B  was  significantly  elevated  in  NSCLC  and 
immortalized  bronchial  epithelial  cell  lines  compared  to  normal  cells.  We  were  then  incited  to 
explore  LAPTM4B  expression  in  a  large  available  array  dataset  of  NSCLC  cell  lines  from  Dr. 
John  D.  Minna  (University  of  Texas 


Figure  17.  LAPTM4B  expression  in  NSCLC  and  immortalized  bronchial 
epithelial  cell  lines.  Increased  expression  of  LAPTM4B  in  NSCLC  cells  compared 
to  HBECS  by  microarray  analysis.  Columns  represent  samples  and  rows  represent 
two  LAPTM4B  probe  sets  (red,  up-regulated;  green,  down-regulated). 


analysis  of  the 
field  cancerization  in  early- 
stage  NSCLC  revealed  that 
LAPTM4B,  a  lysosome 


siLAPTM4B 


24  48 
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Figure  18.  Knockdown  of  LAPTM4B  reduces  lung  cancer  cell 
growth.  A.  QRTPCR  analysis  of  LAPTM4B  expression  in  Calu6 
cells  transfected  with  control  siRNA  or  LAPTM4B- specific  siRNA. 
B.  Trypan  blue  exclusion  count  of  cells  at  24,  48  and  72  h  following 
transfection  of  control  or  L4P77W4B-targeting  siRNA. 


HI  650 


Southwestern,  Dallas,  Texas).  Microarray 
analysis  of  the  cell  lines  demonstrated  that 
LAPTM4B  expression  was  significantly 
higher  (left  cluster)  in  a  fraction  of  NSCLC 
cell  lines  compared  to  bronchial  cells  (right 
dendogram  cluster)  (Figure  17).  These 
findings  utilizing  different  cell  culture 
systems  suggest  that  field  cancerization 
markers,  exemplified  by  LAPTM4B,  may  be 
modulated  in  expression  in  the 
pathogenesis  sequence  of  lung  cancer. 

We  then  attempted  to  preliminarily  assess 
the  functional  relevance  of  the  field 
cancerization  marker  LAPTM4B  to  NSCLC  malignant  phenotype.  Microarray  analysis  of  the 

large  array  of  NSCLC  cell  lines  (Figure  17)  aided 
us  to  select  cell  lines  with  relatively  high 
expression  of  the  gene.  We  employed 
SMARTpool  small  interfering  RNA  (Dharmacon, 
Inc.,  Lafayette,  CO)  that  consist  of  pooled  siRNAs 
targeting  different  regions  of  the  LAPTM4B 
transcript.  Transient  transfection  of  Calu-6  NSCLC 
cells  with  MP77W4B-targeting  siRNA  effectively 
reduced  LAPTM4B  expression  (Figure  18A)  and 
significantly  decreased  cell  growth  evidenced  by 
trypan  blue  exclusion  cell  count  (Figure  18B).  The 
effect  of  RNA  interference-mediated  knockdown  of 
LAPTM4B  expression  on  cell  growth  was 
confirmed  in  another  NSCLC  cell  line  (HI  650, 
Figures  19A,  C  and  E)  and  in  the  immortalized 
bronchial  epithelial  cell  line  BEAS-2B  (Figures  19 
B,  D  and  F)  further  demonstrating  that  LAPTM4B 
is  a  positive  mediator  of  cell  growth  in  NSCLC  cell 
lines. 

We  sought  to  further  confirm  the  role  of  LAPTM4B 
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Figure  19.  Knockdown  of  LAPTM4B  reduces  cell  growth. 
QRTPCR  (A-B)  and  cell  growth  analysis  by  trypan  blue  exclusion 
(C-D)  of  NSCLC  and  immortalized  bronchial  epithelial  cells 
transfected  with  control  or  LAPTM4B  specific  siRNA.  Phase- 
contrast  micrographs  (x20)  of  control  and  LAPTM4B-s\RNA 
transfected  HI 650  (E)  and  BEAS-2B  (F)  cells. 
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in  lung  cancer  cell  growth  by 
generating  sub-lines  with  stable 
knockdown  of  the  gene  in  the 
Calu-6  cell  line  with  high  basal 
level  of  LAPTM4B.  Empty  vectors 
and  vectors  containing  scrambled 
small  hairpin  RNA  (shRNA)  as  well 
as  shRNA  specific  to  LAPTM4B 
(Origene,  Rockville,  MD)  were 
used  for  generation  of  the  stable 
sub-lines.  Stable  selection  was 
performed  by  incubating 
transfected  cells  for  two  weeks  in 
cell  culture  medium  with  10  pg 
puromycin.  QRTPCR 

demonstrated  significantly  reduced 
LAPTM4B  expression  in  Calu-6 
cells  with  stable  knockdown  of 
LAPTM4B  compared  to  cells  stably 
transfected  with  scrambled  shRNA 
or  empty  vectors  (Figure  20A). 

Moreover,  Calu-6  cells  with  stable 
knockdown  of  LAPTM4B  displayed 
significantly  reduced  cell  growth 
and  anchorage-dependent  colony  formation  (Figure  20B  and  Figures  20C-D,  respectively). 
These  data  demonstrate  that  LAPTM4B  field  cancerization  marker  is  a  positive  mediator  of 
immortalized  and  malignant  lung  epithelial  cell  growth.  Moreover,  these  findings  are  proof  of 
principle  and  demonstrate  that  functional  characterization  of  field  cancerization  markers  in  the 
lung  airway  can  lead  to  the  identification  of  otherwise  unknown  functional  targets  and  mediators 
of  lung  cancer  molecular  pathogenesis.  Our  findings  on  the  aforementioned  microarray  analysis 
of  the  field  cancerization  as  well  as  on  the  expression  validation  and  functional  studies  of  field 
cancerization  markers  have  been  presented  as  an  abstract  and  are  currently  being  prepared  as 
a  manuscript  for  publication  which  we  anticipate  submitting  for  peer-review  by  the  end  of  the 
year  (see  Reportable  Outcomes  and  Appendix). 

C.  Collection  of  epithelial  samples  from  both  bronchoscopy  and  lobectomy  specimens 
from  patients  with  lung  cancer  (Sub-specific  Aims  1 A  and  1C): 

We  have  collected  a  complete  set  of  bronchoscopy  (nasal  and  3  bronchial  brushes)  and 
lobectomy  (tumor,  normal  parenchyma,  and  3-5  bronchial  brushes)  epithelial  samples  in  7 
patients.  Along  with  other  cases/collections  from  Boston  University  (BU)  (Partnering  PI,  Avrum 
Spira),  University  of  California,  Los  Angeles  (UCLA)  (Initiating  PI,  Steve  Dubinett)  and 
Vanderbilt  (Partnering  PI,  Pierre  Massion),  the  samples  are  currently  being  analyzed  by  both 
next  generation  RNA-sequencing  (RNA-Seq)  using  the  lllumina  HiSeq  2000  platform  and 
microarray  profiling  using  the  Human  Gene  2.0  ST  platform  from  Affymetrix.  RNA-Seq  and 
microarray  analysis  are  being  performed  at  BU  and  MD  Anderson  Cancer  Center,  respectively. 
The  study  was  uniquely  designed  by  collecting  tumor,  normal  lung  tissue  and  airway  samples 
from  cases  with  NSCLC  obtained  from  all  four  participating  institutions  (Table  1).  Samples  from 
cases  without  lung  cancer  were  collected  and  processed  from  BU,  UCLA  and  Vanderbilt.  Total 
RNA  from  all  samples  in  the  different  institutions  were  isolated  similarly  using  the  miRNeasy  kit 
from  Qiagen  according  to  the  manufacturer’s  instructions.  Hiseq  RNA  sequencing  will  facilitate 
the  discovery  of  novel  transcripts  in  the  molecular  field  of  injury  as  well  quantifying  expression  of 
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Figure  20.  Stable  knockdown  of  LAPTM4B  reduces  lung  cancer  cell 
growth  and  colony  formation.  A.  QRTPCR  analysis  of  Calu-6  cells  stably 
transfected  with  empty  vectors  or  vectors  containing  scrambled  shRNA  or 
LAPTM4B-spec\Uc  shRNA.  B.  Trypan  blue  exclusion  method  count  of  the 
stably  transfected  cells.  C-D.  Anchorage-dependent  colony  formation  assay  of 
the  stably  transfected  Calu-6  cells  was  performed  by  seeding  cells  at  a 
seeding  density  of  500  cells  per  well  in  six-well  plates.  Representative 
triplicate  images  were  obtained  with  phase-light  microscope  (C)  and  quantified 
to  assess  number  of  colonies  (D). 
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those  that  could  not  be  characterized  by  microarray  technology.  In  addition  and  in  contrast  to 
the  Human  Gene  1.0  ST  microarray  platform  used  for  the  field  cancerization  profiling  described 
above,  the  newer  2.0  ST  platform  to  be  utilized  will  enable  the  interrogation  of  the  complete  set 
of  known  functionally  important  (22)  long  intergenic  non-coding  RNAs  (lines)  in  the  molecular 
field  of  injury.  To  date  this  is  the  first  study  that  is  1)  performing  next  generation  sequencing  in 
addition  to  microarray  profiling  analysis  of  the  molecular  filed  of  injury;  2)  applying  multiple  high 
throughput  technologies  on  field  of  injury  samples  obtained  from  four  different  institutions  in  the 
nation  and  3)  cross-comparing  the  complete  topological  map  of  the  extended  and  localized 
molecular  field  of  injury/cancerization  between  both  NSCLC  patients  and  cancer-free 
individuals.  We  anticipate  that  RNA-Seq  and  microarray  profiling  will  be  completed  by  the  end  of 
the  year  with  subsequent  bioinformatic  and  functional  analysis  along  with  validation  of 
expression  studies  completed  by  Spring  2013.  Due  to  the  use  of  both  next  generation  RNA 
sequencing  and  comprehensive  microarray  profiling  and  due  to  this  ongoing  study’s  unique 
design  we  anticipate  that  expression  profiles  in  the  NSCLC  molecular  field  of  injury  will  harbor 
molecules,  both  novel  and  established,  that  may  exhibit  potential  for  use  as  airway  biomarkers 
that  can  be  developed  and  tested  for  lung  cancer  detection  using  minimally  invasive  sites  in 
Specific  Aim  3  of  this  award. 

Table  1.  Molecular  mapping  of  the  field  of  injury  in  NSCLC  and  cancer-free  patients 


RNA-Seq  (cases) 

Microarray  (cases) 

Institution 

ADC  SCC  No  Cancer 

ADC  SCC  No  Cancer 

MD  Anderson 

4  2  0 

4  3  0 

BU 

2  13 

0  2  4 

UCLA 

2  2  1 

3  2  2 

Vanderbilt 

111 

4  3  1 

Number 

9  6  5 

11  10  7 

Total  Nb.  of  cases 
analyzed 

20 

28 

Total  Nb.  of  samples 
analyzed 

156 

183 

RNA-Seq,  RNA  sequencing;  ADC,  adenocarcinoma;  SCC,  squamous  cell  carcinoma;  BU, 
Boston  University;  UCLA,  University  of  California  Los  Angeles. 


Specific  Aim  2:  To  increase  our  understanding  of  the  role  of  tumor-initiating 
stem/progenitor  cells  in  the  pathogenesis  of  lung  cancer  in  the  “field 
cancerization”  that  develops  in  current  and  former  smokers. 

Summary  of  Research  Findings: 

Dr.  Wistuba’s  lab  is  not  part  of  this  Aim. 

Specific  Aim  3:  Test  airway-based  mRNA  and  microRNA  biomarkers  of  diagnosing 

lung  cancer  in  current  and  former  smokers  at  high  risk  for  lung 
cancer  in  minimally  invasive  sites. 


Summary  of  Research  Findings: 


17 


Army  Award  W81XWH-10-1-1007;  Partnering  PI:  Ignacio  Wistuba,  M.D. 
Annual  Report:  Reporting  Period  20  Sept  201 1-19  Sept  2012 _ 


The  studies  on  this  Aim  will  be  carried  out  in  Years  3  and  4  of  the  grant. 

KEY  RESEARCH  ACCOMPLISHMENTS 

•  Identified  aberrant  activation  of  canonical  oncogenes  in  the  molecular  field  of  injury  of  early 
stage  NSCLC  patients  including  phosphorylated  AKT  and  ERK  kinases. 

•  Completed  the  analysis  and  characterization  of  the  molecular  localized  field  cancerization 
using  194  samples  comprised  of  tumor,  normal  lung  and  airway  samples  from  20  NSCLC 
cases. 

•  Derived  field  cancerization  expression  signatures,  comprised  of  genes  concordantly  and 
significantly  differentially  expressed  between  tumors  and  airways  compared  to  matched 
normal  lung  tissue,  and  pertinent  to  NSCLC,  lung  adenocarcinomas  and  SCCs. 

•  In  collaboration  with  BU  (Partnering  institution,  demonstrated  that  molecular  profiles  in  the 
localized  field  cancerization  are,  at  least  in  part,  relevant  to  the  molecular  field  of  injury  and 
contain  markers  significantly  and  concordantly  different  between  airways  of  patients  with 
and  without  lung  cancer 

•  Identified  profiles  that  were  significantly  progressively  and  differentially  expressed  by 
distance  from  corresponding  lung  tumors  and  concordantly  modulated  between  tumors  and 
paired  uninvolved  normal  lung  tissues,  pinpointing  to  their  probable  roles  in  pathogenesis. 

•  Confirmed  the  differential  expression  of  several  site-dependent  and  -independent  markers 
identified  by  microarray  analysis  of  the  field  cancerization  at  the  RNA  and  protein  level  by 
QRTPCR  and  immunohistochemical  analysis,  respectively. 

•  Identified  previously  unknown  promalignant  roles  for  LAPTM4B,  one  of  the  top  5  site- 
dependent  field  cancerization  markers  we  had  found,  in  NSCLC  evidenced  by  reduced 
clonogenic  growth  of  cells  with  knockdown  of  the  gene. 

•  In  collaboration  with  the  Partnering  Pis  and  Initiating  PI  of  this  award,  began  to  analyze  by 
RNA-Seq  at  BU  (20  cases,  156  samples)  and  comprehensive  microarray  profiling  at  MD 
Anderson  (28  cases,  183  samples)  the  molecular  field  of  injury  in  NSCLC  patients  and 
cancer-fee  individuals. 


REPORTABLE  OUTCOMES 


Abstracts: 

•  Kadara  H,  Fujimoto  J,  Yoo  SY,  Garcia  M,  Kabbout  M,  Basey  A,  Wang  J,  Coombes  KR,  Kim 
ES,  Hong  WK,  Kalhor  N,  Moran  C,  Wistuba  II.  Gene  expression  profiling  of  lung  tumors  and 
matched  normal  airways  reveals  common  and  disparate  aberrant  pathways  in  squamous 
cell  carcinoma  and  adenocarcinoma  development  and  potential  targets  for  chemoprevention 
in  early  stage  lung  cancer  patients.  Proceedings  of  the  103rd  Annual  Meeting  of  the 
American  Association  for  Cancer  Research;  2012  Mar  31-  Apr  4;  Chicago,  Illinois. 
Philadelphia  (PA):  AACR;  201 2.  Abstract  #1 721 . 

Manuscripts: 

•  Kadara  H,  Wistuba  II.  Field  cancerization  in  NSCLC:  implications  in  disease  pathogenesis. 
Proceedings  of  the  American  Thoracic  Society,  9(2):38-42  2012. 
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CONCLUSIONS 


During  our  second  year  of  research,  we  derived  expression  profiles  signifying  the  molecular  field 
cancerization  in  early-stage  NSCLC.  We  also  showed  that  the  field  cancerization  profiles, 
identified  in  distal  airways  of  resected  specimens,  are,  at  least  in  part,  relevant  to  the  extended 
field  of  injury  and  differentially  expressed  between  large  airways  of  patients  with  and  without 
lung  cancer.  In  addition,  our  profiling  studies  revealed  expression  patterns  in  bronchial  epithelia 
that  are  significantly  progressively  modulated  with  distance  from  corresponding  tumors  and, 
importantly,  concordantly  modulated  between  tumors  and  paired  uninvolved  normal  lung 
tissues,  pinpointing  their  probable  roles  in  pathogenesis.  In  addition,  based  on  functional 
pathways  analysis  and  statistical  threshold  criteria,  we  validated  the  differentially  expression  of 
several  key  site-dependent  and-independent  field  cancerization  markers.  Furthermore,  we 
found  that,  LAPTM4B,  a  putative  oncogene  with  no  known  role  in  lung  carcinogenesis  and  one 
of  top  5  site-dependent  field  cancerization  markers  that  increased  with  shorter  distance  from 
tumors  in  our  profiling  studies,  was  a  positive  mediator  of  the  malignant  phenotype.  Current 
efforts  are  underway  to  profile  and  RNA-Seq,  along  with  the  other  Pis  of  the  award,  the 
complete  topological  map  of  the  field  of  injury  in  both  NSCLC  patients  and  cancer-free 
individuals  which  are  expected  to  yield  airway  biomarkers  for  lung  cancer  to  be  tested  in  future 
aims  of  the  study. 
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Gene  expression  profiling  of  lung  tumors  and  matched  normal  airways  reveals  common  and 
disparate  aberrant  pathways  in  SCC  and  adenocarcinoma  development  and  potential  targets  for 
chemoprevention  in  early  stage  NSCLC  patients. 
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Chemoprevention  of  non-small  cell  lung  cancer  (NSCLC)  has  been  unsuccessful  in  part  due  to  our  limited 
knowledge  of  its  pathogenesis.  It  has  been  suggested  that  normal  airway  epithelia  share  molecular 
abnormalities  with  tumors  and  may  serve  as  progenitors  for  lung  malignancies.  We  sought  to  analyze 
molecular  profiles  of  lung  adenocarcinomas  and  squamous  cell  carcinomas  (SCCs),  two  major  subtypes 
of  NSCLC,  and  matched  normal  airways  to  elucidate  aberrant  expression  patterns  in  early  phases  of  lung 
tumorigenesis.  All  specimens  were  obtained  from  primary  early  stage  NSCLC  consented  patients  who  did 
not  receive  neoadjuvant  therapy  (n=20).  We  profiled  RNA  isolated  from  tumors  and  normal  lung  as  well 
as  from  brushings  of  multiple  airways  adjacent  to  NSCLC  that  were  histopathologically  confirmed  to  lack 
neoplastic  or  preneoplastic  cells  (n=194).  Expression  signatures  signifying  genes  that  were  significantly 
and  concurrently  differentially  expressed  between  both  tumors  and  airways  compared  to  normal  lung 
tissue  (tumor-airway-normal/TAN  signatures)  were  then  derived  independently  for  SCC  and  smoker 
adenocarcinoma  cases  and  were  comprised  of  1,803  and  1,938  genes,  respectively.  The  TAN  signatures 
effectively  clustered  tumor  and  airways  from  matched  normal  lung  samples  (p<0.001).  Further  analysis 
showed  that  a  subset  of  the  genes  separated  SCC-  and  adenocarcinoma-adjacent  airways.  Moreover, 
pathways  and  gene-network  analysis  using  Ingenuity  pathways  software  highlighted  similarities  and 
differences  in  pathway  modulation  between  airway  epithelial  fields  of  SCCs  and  adenocarcinomas. 
Embryonic  stem  cell  and  eicosanoid  signaling  pathways  were  most  significantly  modulated  among  those 
common  to  both  TAN  signatures  (p<0.001).  Importantly,  retinoic  acid  receptor  and  stem  cell  signaling 
pathways  mediated  by  NANOG  and  lineage-specific  oncogene,  SOX2,  were  most  significantly  modulated 
in  the  SCC  TAN  signature,  whereas  NF-kB  and  PTEN  signaling  pathways  were  most  prevalent  in  the 
adenocarcinoma  TAN  airway  signature  (ah  p<0.001).  Gene  networks  mediated  by  decreased  expression 
of  lineage-specific  oncogene,  NKX2-1/TITF-1,  and  increased  expression  of  TP63  and  networks  mediated 
by  increased  expression  of  the  MET  and  ERBB2  oncogenes  were  predominantly  functionally  modulated 
in  the  SCC  and  adenocarcinoma  TAN  signatures,  respectively.  Quantitative  PCR  analysis  confirmed  the 
up-regulation  of  the  MET  oncogene  in  adenocarcinomas  and  normal  airways  compared  to  normal  lung. 
These  findings  highlight  expression  patterns  and  pathways  that  are  deregulated  differentially  in  the 
pathogenesis  of  lung  adenocarcinomas  and  SCCs  and  therefore  offer  therapeutic  targets  to  guide 
personalized  chemoprevention  in  early  stage  NSCLC  patients.  Supported  by  DoD  W81XWH-10-1-1007. 
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Lung  cancer,  of  which  non-small  cell  lung  cancer  (NSCLC)  composes 
the  majority,  is  the  leading  cause  of  cancer-related  deaths  in  the 
United  States  and  worldwide.  NSCLCs  are  tumors  with  complex  bi¬ 
ology  that  we  have  recently  started  to  understand  with  the  advent 
of  various  histological,  transcriptomic,  genomic,  and  proteomic 
technologies.  However,  the  histological  and  molecular  pathogene¬ 
sis  of  this  malignancy,  in  particular  of  adenocarcinomas,  is  still 
largely  unknown.  Earlier  studies  have  highlighted  a  field  canceriza¬ 
tion  phenomenon  in  which  histologically  normal-appearing  tissue 
adjacent  to  neoplastic  and  pre-neoplastic  lesions  display  molecular 
abnormalities,  some  of  which  are  in  common  with  those  in  the 
tumors.  This  review  will  summarize  advances  in  understanding  the 
field  cancerization  phenomenon  and  the  potential  relevance  of  this 
knowledge  to  gain  important  and  novel  insights  into  the  molecular 
pathogenesis  of  NSCLC  as  well  as  to  subsequent  development  of 
biomarkers  for  early  detection  of  lung  cancers  and  possibly  person¬ 
alized  prevention. 

Keywords:  lung  cancer;  field  cancerization;  pathogenesis;  airway  epi¬ 
thelium 

Lung  cancer  is  the  leading  cause  of  cancer  deaths  in  the  United 
States  and  worldwide  in  both  developing  and  developed  regions 
(1).  The  high  mortality  of  this  disease  is  in  part  due  to  our 
lacking  knowledge  of  the  molecular  mechanisms  governing  lung 
cancer  pathogenesis  as  well  as  the  late  diagnosis  of  the  majority 
of  lung  cancers  after  regional  or  distant  spread  of  the  malig¬ 
nancy  (2).  Non-small  cell  lung  cancer  (NSCLC)  represents  the 
majority  of  diagnosed  lung  cancers  (2)  and  is  mainly  composed 
of  squamous  cell  carcinomas  (SCCs)  and  lung  adenocarcinomas 
(2,  3).  Several  major  differences  exist  between  adenocarcinomas 
and  SCCs.  For  example,  compared  with  SCCs  that  arise  from 
the  major  bronchi  and  are  mainly  centrally  located,  pulmonary 
adenocarcinomas  arise  from  small  bronchi,  bronchioles,  or  al¬ 
veolar  epithelial  cells  and  are  typically  peripherally  located,  as 
reviewed  elsewhere  (2-5).  Moreover,  whereas  SCC  pathogenesis 
is  strongly  linked  to  smoking,  lung  adenocarcinoma  is  the  more 
common  histological  subtype  in  never-smoker  patients  (6-9).  Al¬ 
though  the  sequence  of  lesions  in  the  pathogenesis  leading  to 
SCCs  is  well  described,  little  is  known  about  the  sequential  de¬ 
velopment  of  adenocarcinomas.  Moreover,  we  are  still  lacking  in 
our  knowledge  of  differential  mechanisms  of  molecular  patho¬ 
genesis  among  both  subtypes  of  NSCLC. 
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In  light  of  the  postulated  period  of  time  and  multiple  stages  re¬ 
quired  for  the  development  of  overt  epithelial  tumors,  it  is  plau¬ 
sible  to  assume  that  early  diagnosis  of  lung  cancer  or  intraepithelial 
lesions  coupled  with  effective  prevention  strategies  will  reduce  the 
significant  health  burden  associated  with  this  disease  (10).  Despite 
recent  encouraging  findings  from  the  National  Lung  Screening 
Trial  (NLST)  (11),  early  detection  and  prevention  of  lung  cancer 
is  challenging  due  to  the  lack  of  biomarkers  for  early  diagnosis  of 
the  disease  and  to  the  presence  of  multiple  neoplastic  molecular 
pathways  that  mediate  lung  carcinogenesis.  Earlier  studies  have 
highlighted  a  field  cancerization  phenomenon  in  which  histolog¬ 
ically  normal-appearing  tissue  adjacent  to  neoplastic  and  pre¬ 
neoplastic  lesions  displays  molecular  abnormalities,  some  of  which 
are  in  common  with  those  in  the  tumors.  It  is  plausible  to  assume 
that  understanding  early  events  in  tumor  development  that  com¬ 
mence  in  histologically  normal  epithelium  would  pave  the  way  for 
unmet  effective  and  personalized  strategies  for  lung  cancer  pre¬ 
vention  and  treatment.  This  review  mainly  summarizes  advances 
in  understanding  the  field  cancerization  phenomenon  and  the  po¬ 
tential  relevance  of  this  knowledge  to  gain  important  and  novel 
insights  into  the  molecular  pathogenesis  of  NSCLC  as  well  as  to 
subsequent  development  of  biomarkers  for  early  detection  of  lung 
cancers  and  possibly  personalized  prevention. 

NSCLC  PRE-NEOPLASIA  AND 
MOLECULAR  PATHOGENESIS 

From  biological  and  histopathological  perspectives,  NSCLC  is 
a  complex  malignancy  that  develops  through  multiple  pre-neoplastic 
pathways.  Lung  adenocarcinoma,  a  major  subtype  of  NSCLC,  has 
been  increasing  in  incidence  globally  in  both  smokers  and  non- 
smokers  (9),  with  a  concurrent  decrease  in  SCC  frequency.  It 
has  been  suggested  that  the  increasing  incidence  of  lung  adeno¬ 
carcinomas  compared  with  SCCs  is  in  part  due  to  the  change  in 
the  type  of  cigarettes  used  (lower  nicotine  and  tar)  and  smoking 
habits  and  behavior  (7).  Anatomical  differences  in  the  location  of 
diagnosed  lung  adenocarcinomas  and  SCCs  strongly  suggest  that 
both  NSCLC  subtypes  develop  through  different  histopathologi¬ 
cal  and  molecular  pathways  and  have  different  cells  of  origin; 
however,  the  specific  respiratory  epithelial  cell  type  from  which 
each  lung  cancer  type  develops  has  not  been  established  with 
certainty  (3).  However,  it  is  noteworthy  that  Clara  cells  and  the 
type  II  pneumocytes  are  believed  to  be  the  progenitor  cells  of  the 
peripheral  airways,  and  peripherally  arising  adenocarcinomas  of¬ 
ten  express  markers  of  these  cell  types  (12,  13). 

The  multistage  stepwise  fashion  of  tumor  development  has  been 
demonstrated  in  various  anatomical  organs  exemplified  by  the  op¬ 
erational  stages  occurring  during  skin  carcinogenesis  (14).  Carcino¬ 
genesis  of  the  skin  is  initiated  by  a  carcinogen-induced  mutational 
event,  promoted  by  clonal  outgrowth,  which  may  be  dependent  on 
tumor  promoters,  followed  by  progression  of  premalignant  lesions 
(e.g.,  papillomas)  and  their  conversion  to  malignant  tumors  (14). 
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Importantly,  earlier  genetic  studies  indicate  that  nonrandom, 
sequential  chromosomal  aberrations  (trisomies  of  chromosomes  6 
and  7)  are  associated  with  premalignant  progression  of  mouse  skin 
papillomas  (14-17).  The  sequence  of  histopathological  and  molec¬ 
ular  changes  in  bronchial  epithelia  that  precede  the  development 
of  lung  SCCs  have  been  characterized,  demonstrating  that  sequen¬ 
tially  occurring  allele-specific  molecular  changes  commence  in  dis¬ 
persed  foci  signifying  a  multistage  fashion  of  squamous  lung 
cancer  pathogenesis  (4,  18).  At  least  in  this  subtype  of  NSCLC, 
it  has  been  shown  that  specific  genomic  alterations  (mainly  3p21, 
3p22-24,  3p25,  and  9p21)  occur  in  histologically  normal  bronchial 
epithelia  from  resected  specimens  (18).  Moreover,  and  notably, 
these  alterations  persisted  in  hyperplasias,  dysplasias,  carcinomas 
in  situ ,  and  tumors  that  exhibited  different  commencing  genomic 
aberrations  (18).  In  addition,  loss  of  heterozygosity  (LOH)  in 
the  3p  region  was  demonstrated  in  normal  bronchial  epithelia  of 
cancer-free  smokers,  further  highlighting  the  early  role  this  specific 
genomic  alteration  exerts  in  lung  cancer  pathogenesis  (19,  20). 
It  is  noteworthy  that  McCaughan  and  colleagues  demon¬ 
strated  that  no  low-grade  lesions,  but  all  high-grade  lesions, 
exhibited  3q  amplification  targeting  the  sex  determining  re¬ 
gion  Y-box  2  ( SOX2 )  lineage-specific  oncogene  (21).  These 
previous  findings  and  reports,  through  highlighting  associations 
between  histopathological  sequences  and  specific  molecular 
aberrations,  pinpoint  to  a  multistage  and  multistep  manner  of 
lung  carcinogenesis. 

On  the  other  hand,  one  cannot  neglect  the  alternative  hypoth¬ 
esis  that  the  sequence  of  genetic  and  epigenetic  alterations  is 
irrelevant  to  lung  cancer  pathogenesis,  but  rather  the  accumula¬ 
tion  of  molecular  abnormalities  beyond  a  certain  threshold  medi¬ 
ates  development  of  the  malignant  phenotype.  It  has  been 
suggested  that  at  least  two  molecular  pathways,  the  Kirsten  rat 
sarcoma  viral  oncogene  (KRAS)  and  epidermal  growth  factor 
receptor  ( EGFR )  pathways,  are  involved  in  the  development 
of  smoker  and  never-smoker  adenocarcinomas,  respectively  (2, 
9).  Moreover,  and  as  reviewed  by  Yatabe  and  colleagues,  atyp¬ 
ical  adenomatous  hyperplasias  (AAHs),  which  are  considered  to 
be  precursor  lesions  for  peripheral  lung  adenocarcinomas  (3,  5) 
and  the  only  sequence  of  morphologic  change  identified  so  far 
for  the  development  of  invasive  lung  adenocarcinomas,  exhibit 
KRAS  mutations  more  frequently  than  invasive  adenocarcino¬ 
mas  (5).  Conversely,  our  group  has  previously  demonstrated  that 
EGFR  mutations  commence  in  histologically  normal  bronchial 
epithelia  adjacent  to  lung  adenocarcinomas  and  precede  copy 
number  increase  of  the  oncogene  (22,  23).  EGFR  mutations  also 
are  persistent  throughout  the  different  phases  of  lung  adenocar¬ 
cinoma  development  (5),  which  harbor  different  genomic  alter¬ 
ations  (24).  It  is  plausible  to  surmise  that  only  after  increasing 
our  knowledge  of  the  pre-neoplastic  changes  as  well  as  the  cor¬ 
responding  molecular  abnormalities  leading  to  the  development 
of  lung  adenocarcinomas  would  we  then  be  able  to  more  confi¬ 
dently  determine  whether  adenocarcinomas  follow  a  linear  pro¬ 
gression  mechanism  or  not  (5).  However,  and  based  on  the 
aforementioned  previously  reported  observations  by  our  group 
and  others,  we  believe  that  it  is  not  counterintuitive  to  speculate 
that  development  of  lung  malignant  phenotype,  including  that  of 
adenocarcinomas,  is  due  to  stepwise,  sequence-specific,  and  mul¬ 
tistage  molecular  pathogenesis  as  well  as  accumulation  and  com¬ 
bination  of  genetic  and  epigenetic  abnormalities. 

FIELD  CANCERIZATION 

Smoking-Damaged  Epithelium  and  the  Field 
Cancerization  Phenomenon 

Although  the  majority  of  lung  cancer  patients  are  current  or  for¬ 
mer  smokers,  a  relatively  small  proportion  of  these  smokers 


(approximately  15%)  develop  primary  lung  tumors.  Patients  with 
early-stage  NSCLC  commonly  exhibit  recurrence  or  second  pri¬ 
mary  tumor  development  after  definitive  treatment  by  resective 
surgery.  There  is  a  large  body  of  evidence  that  heavy  smokers 
and  patients  who  have  survived  lung  cancer  compose  a  high- 
risk  population  that  may  be  targeted  for  early  detection  and  che- 
moprevention  efforts  (4).  Although  the  risk  of  developing  lung 
cancer  decreases  after  smoking  cessation,  the  risk  never  returns 
to  baseline.  Pre-neoplastic  changes  have  been  used  as  surrogate 
endpoints  for  chemopreventive  studies.  However,  it  was  sug¬ 
gested  that  this  “shooting  in  the  dark’’  approach  may  explain 
the  reasons  behind  the  general  failures  of  clinical  chemopreven- 
tion  studies  (10).  Therefore,  novel  approaches  to  identify  the  best 
population  to  be  targeted  for  early  detection  and  chemopreven- 
tion  should  be  devised,  and  risk  factors  for  lung  cancer  develop¬ 
ment  or  relapse  need  to  be  better  defined.  For  these  important 
purposes,  a  better  understanding  of  the  biology  and  molecular 
origins  of  lung  cancer  is  warranted. 

Earlier  work  by  Danely  Slaughter  in  patients  with  oral  cancer 
and  oral  premalignant  lesions  has  suggested  that  histologically 
normal-appearing  tissue  adjacent  to  neoplastic  and  pre-neoplastic 
lesions  display  molecular  abnormalities,  some  of  which  are  in 
common  with  those  in  the  tumors  (25).  In  1961,  a  seminal  report 
by  Auerbach  and  colleagues  suggested  that  cigarette  smoke 
induces  extensive  histological  changes  in  the  bronchial  epithelia 
in  the  lungs  of  smokers  and  that  premalignant  lesions  are  wide¬ 
spread  and  multifocal  throughout  the  respiratory  epithelium, 
suggestive  of  a  field  effect  (26).  This  phenomenon,  coined  “field 
of  cancerization,”  was  later  shown  to  be  evident  in  various 
epithelial  cell  malignancies,  including  lung  cancer.  Some  degree 
of  inflammation  and  inflammatory-related  damage  is  almost  in¬ 
variably  present  in  the  central  and  peripheral  airways  of  smok¬ 
ers  and  may  precede  the  development  of  lung  cancer  (4).  Thus, 
the  field  of  cancerization  may  also  be  explained  by  both  the 
direct  effect  of  tobacco  carcinogens  and  the  initiation  of  inflam¬ 
matory  response.  In  this  context,  different  theories  for  the  ori¬ 
gin  of  the  field  cancerization  or  smoking-related  field  of  injury 
have  been  put  forward  and  extensively  reviewed  elsewhere  by 
Steiling  and  coworkers  (27). 

Multiple  altered  foci  of  bronchial  epithelium  are  present 
throughout  the  airway  in  patients  with  lung  cancer  and  smokers 
(18,  28,  29).  As  mentioned  before,  detailed  analysis  of  histol¬ 
ogically  normal,  premalignant,  and  malignant  epithelia  from 
patients  with  lung  SCC  indicated  that  multiple,  sequentially  oc¬ 
curring  allele-specific  chromosomal  deletions  of  LOH  commence 
in  clonally  independent  foci  early  in  the  multistage  pathogenesis 
of  SCCs  (18,  28).  Notably,  31%  percent  of  histologically  normal 
epithelium  and  42%  of  mildly  abnormal  (hyperplasia/metaplasia) 
specimens  had  clones  of  cells  with  allelic  loss  at  one  or  more 
regions  examined.  Moreover,  these  molecular  aberrations  were 
also  found  in  carcinomas  in  situ  and  SCCs  and  at  a  more  ad¬ 
vanced  level  (18).  Molecular  changes  involving  LOH  of  chromo¬ 
somal  regions  3p  (e.g.,  fragile  histidine  triad  gen elFEHT),  9p 
(e.g.,  CDKN2AJpl6 ),  genomic  instability  (increased  microsatel¬ 
lite  repeats),  and  pl6  methylation  have  been  demonstrated  in 
histologically  normal  bronchial  epithelia  in  patients  with  SCC 
and  in  the  sequence  of  pathogenesis  of  the  disease  (3).  Moreover, 
Nelson  and  colleagues  demonstrated  that  KRAS  is  also  mutated 
in  histologically  normal  lung  tissue  adjacent  to  lung  tumors  (30). 
In  addition,  similar  epigenetic  and  gene  methylation  patterns 
between  tumors  and  adjacent  histologically  normal  epithelia 
were  described.  Belinsky  and  colleagues  reported  aberrant  pro¬ 
moter  methylation  of  pl6,  which  was  described  to  be  commonly 
methylated  in  lung  tumors  (31),  in  at  least  one  bronchial  epithe¬ 
lial  site  from  44%  of  lung  cancer  cases  examined  (32).  Moreover, 
p!6  and  death-associated  protein  kinase  ( DAPK )  promoter 
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methylation  were  frequently  observed  in  bronchial  epithelium 
from  smoker  but  not  from  never-smoker  patients  with  lung  can¬ 
cer  and  persisted  after  smoking  cessation  (32). 

The  aforementioned  molecular  abnormalities  were  detected 
in  histologically  normal  epithelia  adjacent  to  archival  surgically 
resected  tumors  from  patients  with  primary  lung  cancer.  LOH 
and  microsatellite  alterations  in  multiple  foci  were  also  detected 
in  distal  histological  normal  bronchial  epithelia  of  smokers  with¬ 
out  cancer  (19,  20).  Moreover,  and  importantly,  these  molecular 
abnormalities  were  detected  in  bronchial  epithelia  of  cancer- 
free  former  smokers  and  appeared  to  have  persisted  for  many 
years  after  smoking  cessation.  In  addition,  LOH  was  detected  in 
DNA  obtained  from  bronchial  brushings  of  normal  and  abnor¬ 
mal  lungs  from  patients  undergoing  diagnostic  bronchoscopy 
and  was  detected  in  cells  from  the  ipsilateral  and  contralateral 
lung  (33).  Mutations  in  TP53  were  also  described  to  occur  in 
bronchial  epithelia  of  cancer-free  smokers  in  a  widely  dispersed 
manner  (34).  Similar  evidence  also  exists  for  promoter  methyl¬ 
ation  and  epigenetic  changes  in  smoking-damaged  lung  epithe¬ 
lium  of  cancer-free  patients.  Methylation  of  various  genes, 
including  retinoic  acid  receptor  2  p  (RAR-/32),  H-cadherin,  ad¬ 
enomatous  polyposis  coli  (A  PC).  pl6,  and  Ras  association 
(RalGDS/AF-6)  domain  family  member  1  ( RASSFF1A )  has 
been  described  in  bronchial  epithelial  cells  of  heavy  smokers 

(35) .  Moreover,  methylation  of  pl6 ,  glutathione  S-transferase 
pi  1  ( GSTP1 ),  and  DAPK  was  reported  to  be  evident  in  bron¬ 
chial  brushings  of  one-third  of  cancer-free  smokers  examined 

(36) .  In  the  study  by  Belinsky  and  colleagues,  as  mentioned  be¬ 
fore,  methylation  of  pl6  was  detected  in  epithelia  of  cancer-free 
smokers  (32).  A  more  detailed  list  of  aberrant  gene  promoter 
methylation  in  patients  with  lung  cancer  and  cancer-free  smok¬ 
ers  is  well  summarized  and  explained  in  the  review  by  Heller 
and  coworkers  (37). 


Transcriptomic  Studies  of  Lung  Field  Cancerization 

High-throughput  microarray  profiling  was  shown  to  be  useful  to 
study  the  transcriptome  of  lung  airways.  Hackett  and  colleagues 
studied  the  expression  of  44  antioxidant-related  genes  using 
bronchial  brushings  from  cancer-free  current  smokers  and 
never-smokers  and  found  significant  up-regulation  of  16  of  the 
antioxidant  genes  in  the  airways  of  smokers  compared  with  non- 
smokers  (38).  Later,  Spira  and  colleagues  described  global  alter¬ 
ations  in  gene  expression  between  normal-appearing  bronchial 
epithelium  of  healthy  cancer-free  smokers  and  that  of  non- 
smokers  (39).  Importantly,  irreversible  changes  in  expression 
in  airways  of  former  smokers  after  years  of  smoking  cessation 
were  described  that  were  believed  to  underlay  the  increased  risk 
former  smokers  exhibit  for  developing  lung  cancer  (39,  40). 
Alterations  in  the  expression  of  microRNAs  were  also  demon¬ 
strated  between  large  airways  of  current  and  never-smokers 
(41).  Notably,  an  80-gene  signature  was  derived  from  the  tran¬ 
scriptome  of  large  airway  epithelial  cells  that  can  distinguish 
smokers  without  overt  cancer  from  smokers  with  lung  cancer 
despite  originating  from  normal  bronchial  epithelia  (42).  More 
recently,  Gustafson  and  coworkers  derived  a  phosphoinositide- 
3-kinase  ( PI3K )  pathway  activation  signature  by  using  recombi¬ 
nant  adenoviruses  to  express  the  110a  subunit  of  PI3K  in  primary 
human  epithelial  cells  (43).  The  PI3K  pathway  activation  sig¬ 
nature  was  elevated  in  cytologically  normal  bronchial  airways  of 
smokers  with  lung  cancer  and,  importantly,  was  decreased  in  the 
airways  of  high-risk  smokers  whose  dysplastic  lesions  regressed 
after  treatment  with  the  PI3K  inhibitor  myoinositol  (43).  Micro¬ 
array  and  gene  expression  profiling  methodologies  were  also 
used  to  demonstrate  the  wide  anatomical  spread  of  the  lung 
field  cancerization.  Common  gene  expression  alterations  were 
identified  in  bronchial,  nasal,  and  buccal  epithelia  of  smokers 
(44),  and  in  a  separate  study,  the  expression  of  119  genes  was 
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analysis  of  the  field  canceriza¬ 
tion  in  non-small  cell  lung  can¬ 
cer  (NSCLC).  The  relevance  of 
the  lung  field  cancerization  to 
the  development  of  a  partic¬ 
ular  subtype  of  NSCLC  (i.e., 
adenocarcinoma  compared  with 
squamous  cell  carcinoma  [SCC]), 
is  still  unknown,  yet  possible.  An¬ 
alyzing  local  and  distant  field  of 
cancerization  by  analysis  of  the 
transcriptome  of  airway  brush¬ 
ings  from  multiple  sites  indepen¬ 
dently  for  lung  adenocarcinoma 
( yellow  spots)  and  SCC  ( red  spots) 
cases  may  shed  light  on  events 
common  or  unique  to  the  mo¬ 
lecular  pathogenesis  of  the  two 
major  subtypes  of  NSCLC.  A 
"compartmental"  approach  cou¬ 
pled  to  a  gradient  or  detailed 
molecular  mapping  method, 
which  spans  the  tumor  up  to 
the  nasal  or  buccal  epithelium, 
to  studying  the  field  of  canceriza¬ 
tion  may  unravel  biomarkers  that 
can  guide  personalized  preven¬ 
tion  strategies  suitable  for  each  different  NSCLC  subtype.  In  addition,  a  comparison  of  the  distant  field  cancerization  in  patients  with  cancer  (left)  to  the 
expression  patterns  of  the  corresponding  anatomical  location  in  disease-free  individuals  (e.g.,  high-risk  heavy  smokers;  right)  would  facilitate  the  development 
of  efficacious  markers  for  the  detection  of  NSCLC.  ADC  =  adenocarcinoma;  miRNA  =  microRNA. 
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demonstrated  to  be  affected  by  smoking  similarly  in  both  bron¬ 
chial  and  nasal  epithelium  (45). 

Field  Cancerization  Compartmentalization 

In  light  of  the  prevalence  of  mutations  in  the  EGFR  oncogene  in 
adenocarcinomas  and  in  particular  those  occurring  in  never- 
smokers,  Tang  and  colleagues  investigated  the  presence  of 
EGFR  mutations  in  normal  bronchial  and  bronchiolar  epithe¬ 
lium  adjacent  to  EGFR  mutant  tumors  (22).  EGFR  mutations 
were  detected  in  histologically  normal  peripheral  epithelia  in 
44%  of  patients  with  lung  adenocarcinoma  with  mutations  but 
none  in  patients  lacking  mutations  in  the  oncogene  (22).  More¬ 
over,  the  same  study  highlighted  more  frequent  EGFR  muta¬ 
tions  in  normal  epithelium  within  the  tumor  (43%)  than  in 
adjacent  sites  (24%),  suggesting  a  localized  field  effect  phenom¬ 
enon  for  this  abnormality  in  the  respiratory  epithelium  of  the 
lung  (22).  In  addition,  a  higher  frequency  of  mutations  in  cells 
obtained  from  small  bronchi  (35%)  compared  with  bronchioles 
(18%)  was  detected  (22).  More  recently,  EGFR  protein  over¬ 
expression,  similar  to  mutation  of  the  gene,  also  exhibited  a  lo¬ 
calized  field  effect,  as  it  was  more  frequent  in  normal  bronchial 
epithelia  sites  within  tumors  than  in  sites  adjacent  to  and  distant 
from  tumors  (23).  Interestingly,  EGFR  copy  number  alteration 
was  not  evident  in  normal  bronchial  epithelia,  which  is  in  ac¬ 
cordance  with  findings  that  EGFR  copy  number  is  a  relatively 
late  event  in  pathogenesis  of  adenocarcinomas  (5,  23).  These 
findings  suggest  that  adenocarcinomas  may  be  associated  with 
a  field  cancerization  dissimilar  from  that  linked  to  SCCs. 

The  low  frequency  of  molecular  abnormalities  detected  in  the 
centrally  located  bronchial  respiratory  epithelium  in  patients 
with  peripheral  lung  adenocarcinomas,  compared  with  specimens 
from  patients  with  SCCs  (28),  suggests  the  presence  of  two  com¬ 
partments  in  the  lung  with  different  degrees  of  smoking-related 
genetic  damage.  Thus,  smokers  who  develop  SCCs  display  more 
smoking-related  genetic  damage  in  the  respiratory  epithelium  of 
the  central  airway,  whereas  patients  who  develop  adenocarci¬ 
nomas  exhibit  molecular  and  histological  damage  mainly  in  the 
peripheral  airways.  Although  some  molecular  changes  (e.g., 
inflammation  and  signaling  pathways  activation)  have  been 
detected  throughout  the  lung  airway  and  include  both  compart¬ 
ments  (central  and  peripheral  airway),  other  aberrations  have 
been  more  frequently  altered  in  either  central  (e.g.,  LOH,  ge¬ 
netic  instability  evidenced  by  microsatellite  repeats)  or  periph¬ 
eral  (e.g.,  EGFR  mutations  as  mentioned  above)  airways.  These 
interesting  observations  indicate  a  possible  compartmentaliza¬ 
tion  of  field  cancerization  and  its  dissimilarity  between  adeno¬ 
carcinomas  and  SCCs,  which  may  well  reflect  the  differential 
mechanisms  of  pathogenesis  of  both  NSCLC  subtypes. 

FUTURE  DIRECTIONS  AND  CONCLUSION 

Applying  the  same  advanced  high-throughput  methodologies 
currently  used  in  studying  established  tumors  for  the  genetic 
analysis  of  lung  NSCLC  pre-neoplasia  and  histologically  normal 
adjacent  regions  is  expected  to  expand  our  understanding  of  the 
biology  of  this  prevalent  disease.  Next-generation  sequencing 
technology,  through  whole-genome,  whole-exome,  and  whole- 
transcriptome  approaches,  holds  great  promise  for  providing  in¬ 
valuable  insights  into  NSCLC  biology,  diagnosis,  prevention,  and 
therapy  (46).  An  important  step  in  this  direction  was  a  recent 
study  in  which  RNA  of  bronchial  airway  epithelial  cell  brush- 
ings  from  healthy  never-smokers  and  smokers  with  and  without 
lung  cancer  was  analyzed  by  RNA  sequencing  (47)  and  provided 
additional  insight  besides  that  provided  when  using  microarray 
technology. 


Earlier  findings  demonstrated  that  centrally  located  lung 
SCCs  and  peripherally  located  lung  adenocarcinomas  elicit 
and  perpetuate  differential  effects  on  the  airway  epithelia  (4). 
Changes  in  expression  in  the  lung  field  of  injury  have  been  shown 
to  be  similar  in  the  large  and  small  airways,  and  it  is  unknown 
whether  they  are  associated  with  the  development  of  the  partic¬ 
ular  subtype  of  NSCLC.  Addressing  this  question  may  be  highly 
important,  because  both  NSCLC  subtypes  display  different  ge¬ 
nomic  features  (2)  and,  therefore,  are  clinically  managed  by  sig¬ 
nificantly  dissimilar  treatment  strategies,  let  alone  differences 
among  various  subtypes  of  lung  adenocarcinomas.  Moreover, 
revisiting  the  field  cancerization  effect  using  a  compartmental 
coupled  with  a  gradient  or  detailed  molecular  mapping  approach 
in  patients  with  cancer  and  disease -free  individuals  (Figure  1) 
will  shed  light  on  events  in  the  early  pathogenesis  of  lung  adeno¬ 
carcinomas  and  SCCs  and  unravel  biomarkers  that  can  guide 
targeted  and  personalized  chemoprevention  strategies  suitable 
for  each  different  NSCLC  subtype  as  well  as  detection  efforts, 
in  particular  using  less  invasive  sites. 

Despite  numerous  efforts  that  have  centered  on  increasing  our 
understanding  of  the  biology  of  lung  cancer,  this  malignancy  still 
composes  the  biggest  share  of  cancer-related  deaths  in  the  United 
States  and  worldwide.  Compared  with  advances  in  targeted  and  per¬ 
sonalized  therapy  of  NSCLC,  little  progress  has  been  made  in  the 
tailored  prevention  of  this  fatal  malignancy.  This  may  change  with 
the  recent  encouraging  and  significant  findings  of  the  National  Lung 
Screening  Trial  (11).  Various  molecular  markers  and  expression 
classifiers  previously  described  in  the  lung  airways  and  in  less- 
invasive  sites  of  field  cancerization  (e.g.,  nasal  epithelium)  can  aid 
in  selecting  high-risk  individuals  best  suited  for  computed  tomog¬ 
raphy  screening,  for  example.  A  comprehensive  analysis  of  early 
molecular  events  in  NSCLC  pathogenesis  will  undoubtedly  unravel 
biomarkers  that  can  guide  future  chemoprevention  strategies. 

Author  disclosures  are  available  with  the  text  of  this  article  at  www.atsjournals.org. 
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Introduction:  Understanding  oncogenes  and  tumor  suppressor 
genes  expression  patterns  is  essential  for  characterizing  lung  can¬ 
cer  pathogenesis.  We  have  previously  demonstrated  that  m Gprc5a/ 
h GPRC5A  is  a  lung-specific  tumor  suppressor  evidenced  by  inflam¬ 
mation-mediated  tumorigenesis  in  Gprc5a-knockout  mice.  The 
implication  of  GPRC5A  in  human  lung  cancer  pathogenesis,  includ¬ 
ing  that  associated  with  inflammatory  chronic  obstructive  pulmonary 
disease  (COPD),  a  risk  factor  for  the  malignancy,  remains  elusive. 
Methods:  We  sought  to  examine  GPRC5A  immunohistochemi- 
cal  expression  in  histologically  normal  bronchial  epithelia  (NBE) 
from  lung  disease-free  never-  and  ever-smokers  (n  =  13  and  n  =  18, 
respectively),  from  COPD  patients  with  ( n  =  26)  and  without  cancer 
(n  =  24)  and  in  non-small  cell  lung  cancers  (NSCLCs)  ( n  =  474). 
Quantitative  assessment  of  GPRC5  A  transcript  expression  in  airways 
(n  =  6),  adjacent  NBEs  ( n  =  29)  and  corresponding  tumors  ( n  =  6) 
from  6  NSCLC  patients  was  also  performed. 

Results:  GPRC5A  immunohistochemical  expression  was  signifi¬ 
cantly  lower  in  tumors  compared  to  uninvolved  NBE  ip  <  0.000 1 )  and 
was  positively  associated  with  adenocarcinoma  histology  (p  <  0.00 1 ). 
GPRC5A  airway  expression  was  highest  in  lung  disease-free  NBE, 
decreased  and  intermediate  in  NBE  of  cancer-free  COPD  patients 
(p  =  0.004)  and  further  attenuated  and  lowest  in  epithelia  of  COPD 
patients  with  adenocarcinoma  and  SCC  ip  <  0.0001).  Furthermore, 
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GPRC5A  mRNA  was  significantly  decreased  in  NSCLCs  and  corre¬ 
sponding  NBE  compared  to  uninvolved  normal  lung  ip  =  0.03). 

Conclusions:  Our  findings  highlight  decreased  GPRC5A  expression 
in  the  field  cancerization  of  NSCLC,  including  that  associated  with 
lung  inflammation.  Assessment  of  the  use  of  GPRC5A  expression  as 
a  risk  factor  for  NSCLC  development  in  COPD  patients  is  warranted. 

KEYWORDS:  Field  cancerization.  Chronic  obstructive  pulmonary 
disease,  Non-small-cell  lung  cancer,  g-protein  coupled  receptor 
family  C,  group  5,  member  A,  gene  expression. 
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Lung  cancer,  the  majority  of  which  is  non-small  cell  lung 
cancer  (NSCLC),  is  the  leading  cause  of  deaths  in  the 
United  States  and  worldwide.1  The  high  mortality  associ¬ 
ated  with  lung  cancer  is  in  part  because  of  late  diagnosis  after 
regional  or  distant  spread  of  the  disease.2  Improved  clinical 
management  of  NSCLC  is  tightly  linked  to  the  identification 
of  new  effective  early  biomarkers  that  can  spear  novel  strate¬ 
gies  for  early  detection,  prevention,  and  treatment.2,3 

The  majority  (85%)  of  diagnosed  NSCLC  cases  are 
attributable  to  cigarette  smoking.4,5  Auerbach  et  al. 6  earlier 
showed  that  tobacco  carcinogen  exposure  causes  multifocal 
and  clonal  histopathological  changes  in  the  airway  epithelia 
of  smokers,  suggestive  of  a  field  cancerization  in  the  lung.  In 
addition,  cigarette  smoking  perpetuates  inflammation  through¬ 
out  the  smoking-exposed  airway  epithelia  of  heavy  smokers,7 
which  was  suggested,  to  lead  to  onset  of  lung  cancer  develop¬ 
ment.8  Chronic  obstructive  pulmonary  disease  (COPD)  of  the 
lung  is  an  inflammatory  condition  that  is,  like  lung  cancer, 
causally  linked  to  cigarette  smoking9,10  and  is  a  major  cause 
of  mortality  in  the  United  States.11  Moreover,  preinvasive  lung 
cancer  lesions  are  common  (approximately  50%)  in  airways 
of  COPD  patients.12  Importantly,  although  phenotypically 
healthy  smokers  comprise  a  significant  population  at  risk  for 
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lung  cancer,57  COPD  increases  lung  cancer  risk  by  4.5-fold, 
more  than  either  age  or  quantity  of  smoking. 11,13,14  In  addi¬ 
tion,  33%  of  patients  with  mild-to-moderate  COPD  eventually 
die  because  of  lung  cancer  burden.11  Although  smoking  is  the 
main  cause  of  both  diseases,  only  10%  to  15%  of  smokers 
develop  COPD  and/or  lung  cancer,  suggesting  that  other  fac¬ 
tors,  including  genetic  variation  and  expression,  may  differ 
across  individuals  in  response  to  cigarette  smoke.4,5,9,10  It  is 
plausible  to  assume  that  understanding  common  expression 
patterns  between  both  diseases  for  subsequent  identification 
of  biological  markers  that  explain  risk  of  disease  onset  and/or 
progression  will  favorably  promote  the  clinical  management 
of  COPD-associated  lung  cancer.11 

Our  group  has  previously  demonstrated  that  loss  of 
G-protein  coupled  receptor,  family  C,  group  5,  member  A 
( m GprcSa/h GPR C5A ),  exemplified  by  the  G/vxOa-knockout 
mouse  model,  leads  to  chronic  inflammation  and  spontaneous 
lung  tumor  development.15,16  However,  the  expression  of  this 
tumor  suppressor  in  human  lung  NSCLCs,  including  those 
associated  with  inflammatory  conditions  such  as  COPD,  is 
unknown.  We  investigated  the  immunohistochemical  expres¬ 
sion  of  GPRC5A  in  a  large  series  of  NSCLC  histological  tis¬ 
sue  specimens  and  in  normal  bronchial  epithelia  (NBE)  from 
lung-disease-free  individuals,  COPD  patients  and  from  COPD 
patients  with  lung  cancer  and  its  transcript  expression  in  inde¬ 
pendent  resected  normal  epithelia  and  corresponding  tumors. 
Our  findings  reveal  marked  reduced  GPRC5A  expression  in 
human  NSCLC  and  in  the  histologically  normal  field  can- 
cerization  associated  with  COPD  and  lung  cancer.  Our  study 
pinpoints  to  a  potential  role  of  this  tumor  suppressor  in  the  pro¬ 
gression  of  COPD-associated  NSCLC,  which  warrants  further 
studies  to  assess  its  use  as  a  risk  marker  for  this  disease. 

MATERIALS  AND  METHODS 

NSCLC  Tissue  mMcroarrays  and  Normal 
Bronchial  Epithelial  Specimens 

Tissue  microarrays  (TMAs)  used  in  this  study  com¬ 
prised  474  surgically  resected  NSCLC  tumor  specimens  (308 
adenocarcinomas,  166  squamous  cell  carcinomas  [SCCs]) 
collected  under  an  Institutional  Review  Board  protocol  and 
archived  as  formalin-fixed  paraffin-embedded  (FFPE)  speci¬ 
mens  in  the  University  of  Texas  Specialized  Program  of 
Research  Excellence  thoracic  tissue  bank  at  the  University 
of  Texas  M.D.  Anderson  Cancer  Center.  Clinicopathological 
features  of  the  NSCLC  patients  examined  are  summarized  in 
Table  1.  TMAs  were  prepared  with  a  manual  tissue  arrayer 
(Advanced  Tissue  Arrayer  ATA100,  Chemicon  International, 
Temecula,  CA)  using  1-mm  diameter  cores  in  triplicate  for 
tumors,  as  described  previously.17  Histological  sections  4  pm 
in  thickness  were  then  prepared  for  subsequent  immunohisto- 
chemistry  analysis.  FFPE  specimens  of  NBE  from  50  patients 
with  COPD  with  forced  expiratory  volume  in  1  second/forced 
vital  capacity  ratio  of  70%  or  less,  a  smoking  history  of  30  or 
more  pack-years,  and  collected  at  Baylor  college  of  Medicine 
(Houston,  TX)  were  included  in  the  study,  including  24  can¬ 
cer-free  cases  and  26  cases  with  NSCLC.  In  addition,  NBE 
specimens  from  never  smokers  ( n  =13)  and  ever  smokers 


TABLE  1.  Clinicopathological  Features  of  NSCLC  Patients  in 


Tissue  Microarray  Sets  Used  in  the  Study 

Covariate 

Levels 

«  (%) 

Sex 

Female 

240  (50.6) 

Male 

234  (49.4) 

Histology 

Adenocarcinoma 

308  (65.0) 

Squamous  cell  carcinoma 

166  (35.0) 

Stage 

i 

302  (63.7) 

ii 

86(18.1) 

III  or  IV 

86(18.1) 

Grade* 

Well 

36(10.4) 

Moderate 

193  (55.9) 

Poor 

116(33.6) 

Smoking  status 

Current 

178  (37.6) 

Former 

229  (48.3) 

Never 

67(14.1) 

Tobacco  history 

No 

67(14.1) 

Yes 

407  (85.9) 

*Information  on  differentiation  grade  was  available  in  a  subset  of  NSCLCs  analyzed. 
NSCLC,  non-small-cell  lung  cancer. 


(n  =  18)  without  lung  cancer  and  with  no  or  mild  airway 
obstruction  status  were  analyzed. 

Immunohistochemical  Analysis 

Polyclonal  antirabbit  antibodies  raised  against  human 
GPRC5A  were  generated  by  peptide  synthesis  and  site- 
directed  carrier  conjugation  using  keyhole  limpet  hemocyanin 
as  a  custom  service  by  Zymed  Laboratories  Inc.  (South 
San  Francisco,  CA).  The  synthetic  peptide  (cysteine)- 
PSPYKDYEVKKEGS-COOH,  corresponding  to  amino 
acids  344-357  in  the  human  GPRC5A  C-terminus,  was 
covalently  linked  to  keyhole  limpet  hemocyanin  via  an  added 
cysteine  residue,  and  polyclonal  antipeptide  antibodies  were 
prepared  in  rabbits.  Sera  were  confirmed  to  contain  high-titer 
antibodies  against  the  specific  peptide  using  enzyme-linked 
immunosorbent  assays.  TMAs  and  the  histologic  sections  (4 
pm  thick)  of  surgical  resected  specimens  were  deparaffinized 
and  hydrated,  and  antigen  retrieval  was  performed  using  a 
decloaker  with  Dako  target  retrieval  system  at  a  pH  of  6.0  (Dako 
North  America,  Inc.,  Carpinteria,  CA).  Intrinsic  peroxidase 
activity  was  blocked  by  3%  methanol  and  hydrogen  peroxide 
for  12  minutes  and  serum-free  protein  block  (Dako)  was  used 
for  30  minutes  for  blocking  nonspecific  antibody  binding. 
Slides  were  then  incubated  with  the  antibodies  against  human 
GPRC5  A  ( 1 :200  dilution)  at  4°C  overnight.  After  three  washes 
in  Tris-buffered  saline,  slides  were  incubated  for  30  minutes 
with  Dako  Envision  +  Dual  Link  at  room  temperature.  After 
three  additional  washes,  slides  were  incubated  with  Dako 
chromogen  substrate  for  5  minutes  and  were  counterstained 
with  hemotoxylin  for  another  5  minutes.  FFPE  whole-section 
specimens,  except  for  the  omission  of  the  primary  antibodies, 
were  used  as  negative  controls.  The  intensity  and  extent  of 
cytoplasmic  and  nuclear  GPRC5A  immunostaining  were 
evaluated  using  a  light  microscope  (magnification,  x  20) 


2 


Copyright  ©  2012  by  the  International  Association  for  the  Study  of  Lung  Cancer 


09/25/12 


Anjana 


4  color  fig(s);  F#  03:30  Art:  JT0202230 


journal  of  Thoracic  Oncology  •  Volume  XX,  Number  XX,  XX  201 2  CPRC5A  in  COPD-Linked  NSCLC  Pathogenesis 


independently  by  two  pathologists  (JF  and  IIW).  GPRC5A 
immunoreactivity  was  mainly  cytoplasmic,  which  was 
quantified  using  a  four- value  intensity  score  (0,  none;  1,  weak; 
2,  moderate;  and  3,  strong)  and  the  percentage  (0-100%)  of 
the  extent  of  reactivity.  A  final  cytoplasmic  expression  score 
was  obtained  by  multiplying  the  intensity  and  reactivity 
extension  values  (range,  0-300). 

CPRC5A  Transcript  Expression  in  Resected 
NSCLC  Specimens  and  Adjacent  Airway 
Epithelia 

GPRC5A  mRNA  expression  was  assessed  in  ever-smoker 
NSCLCs  and  their  uninvolved  normal  lung  parenchyma  tis¬ 
sues  as  well  as  in  NBE  collected  by  brushing  of  multiple 
corresponding  airways  from  the  same  lobectomy-  and  pneu¬ 
monectomy-resected  specimens.  Tumor  tissues  from  six  ever- 
smoker  NSCLCs  and  normal  lung  specimens  (n  =  6)  as  well  as 
NBE  by  brushings  (n  =  29 )  from  the  same  patients  were  obtained 
under  an  Institutional  Review  Board  approved  protocol  in  which 
specimens  were  collected  from  patients  who  had  signed  letters 
of  consent.  NBE  were  collected  from  multiple  adjacent  to  tumor 
airways  by  brushing  each  site  independently  using  Cytosoff 
cytology  brushes  (Cardinal  Health,  Dublin,  OH).  Parallel 
brushes  were  used  for  touch-prep  for  cytological  assessment  by 
pan-cytokeratin  staining,  which  revealed  epithelial  content  more 
than  90%.  Normal  histology  was  determined  by  hematoxylin 
and  eosin  staining.  Brushes  were  immediately  placed  in  Qiazol 
lysis  reagent  (Qiagen,  Valencia,  CA)  and  stored  in  -80°C  until 
further  processing.  NSCLC  and  normal  lung  specimens  were 
shaved  for  histological  assessment  of  percentage  of  tumor  con¬ 
tent  and  malignant  cells  and  for  corresponding  RNA  isolation. 
Total  RNA  from  all  samples  was  purified  using  the  miRNeasy 
kit  (Qiagen)  according  to  the  manufacturer’s  instructions.  RNA 
quantity  was  determined  using  the  NanoDrop  spectrophotome¬ 
ter,  (Thermo  Scientific,  Wilmington,  DE),  and  quality  was  deter¬ 
mined  by  analysis  of  RNA  integrity  with  Agilent  Bioanalyzer 
2000  (Agilent  Technologies,  Santa  Clara,  CA). 

Quantitative-Real  Time  Polymerase 
Chain  Reaction 

Total  RNA  (150ng)  was  reverse-transcribed  using 
the  high-capacity  RNA-to-cDNA  kit  (Life  Technologies, 
Carlsbad,  CA)  according  to  the  manufacturer’s  instructions. 
Quantitative  real-time  polymerase  chain  reaction  (PCR)  was 
performed  using  TaqMan®  (Applied  Biosystems,  Foster  City, 
CA)  gene  expression  assays  for  GPRC5A  (Hs00173681_ml) 
and  beta-actin  ( ACTB )  (Hs99999903_ml)  primers  (Life 
Technologies)  on  a  7900HT  Fast-Real-Time  PCR  System 
(Life  Technologies)  according  to  the  manufacturer’s  instruc¬ 
tions.  All  samples  were  done  in  triplicates  and  normalized  to 
ACTB.  Relative  quantification  was  calculated  using  the  com¬ 
parative  cycle  threshold  method  as  previously  described.18 

Statistical  Analysis 

Summary  statistics,  including  frequency  tabulation, 
means,  standard  deviations,  median,  and  range,  were  given 
to  describe  patient  characteristics.  Wilcoxon  rank  sum  test 


or  Kruskal-Wallis  test  was  used  to  compare  GPRC5A 
immunohistochemical  expression  between  two  levels  or  among 
more  than  two  levels,  when  appropriate.  General  linear  model 
was  applied  to  test  the  difference  of  GPRC5  A  expression  across 
NBE  from  COPD-  and  cancer-free  never  and  ever  smokers, 
cancer-free  COPD  patients,  as  well  as  NBE  from  patients  with 
both  COPD  and  cancer.  Repeated  measures  analysis  was  used  to 
determine  significance  of  GPRC5A  transcript  variation  across 
matched  NSCLCs,  airways,  and  uninvolved  normal  lung.  All 
statistical  tests  were  two-sided,  and p  values  of  0.05  or  less  were 
considered  to  be  statistically  significant.  Statistical  analysis  was 
performed  with  standard  statistical  software,  including  SAS 
Release  9.1.3  for  Windows  and  S-Plus  8.0  for  Windows. 


RESULTS 

GPRC5A  Immunohistochemical  Expression  in 
Human  NSCLC 

We  have  previously  demonstrated  that  m Gprc5at 
h GPRC5A  is  a  lung-specific  tumor  suppressor  evidenced 
by  spontaneous  tumor  development  in  mice  with  knockout 
of  this  gene  as  well  as  loss  of  the  transformed  phenotype  in 
malignant  NSCLC  cell  lines  after  GPRC5A  overexpression.16 
We  sought  to  examine  the  immunohistochemical  expression 
patterns  of  GPRC5A  tumor  suppressor  in  NSCLC,  which  is 
unknown,  and  its  association  with  clinicopathological  fea¬ 
tures  of  the  disease,  including  the  two  major  histologic  types 
examined,  adenocarcinoma  and  SCC.  The  clinicopatho¬ 
logical  characteristics  of  the  NSCLC  patients  from  which 
FFPE  tumor  specimens  were  analyzed  are  detailed  in  Table 
1.  Representative  photomicrographs  of  GPRC5A  immuno¬ 
histochemical  expression  in  human  lung  adenocarcinoma 
(left)  and  SCC  (right)  are  depicted  in  Figure  1  A.  GPRC5A 
immunoreactivity  was  mainly  cytoplasmic.  GPRC5A 
expression  was  significantly  lower  in  the  adenocarcinomas 
(n  =  308;  mean,  48.91  ±36;  median,  41.67;  minimum,  0; 
maximum,  180)  and  SCCs  («  =  166;  mean,  23.47±22.79; 
median,  18.33;  minimum,  0;  maximum,  110)  examined  com¬ 
pared  with  the  uninvolved  normal  bronchial  epithelia  (mean, 
164.52±34.04;  median,  160;  minimum,  100;  maximum, 
230)  from  lung-disease-free  smokers  and  never  smokers 
(p  <  0.0001)  (Fig.  IB).  Moreover,  GPRC5A  immunohisto¬ 
chemical  expression  was  significantly  lower  in  SCCs  com¬ 
pared  with  that  in  adenocarcinomas  {p  <  0.0001)  (Fig.  IB). 
We  then  correlated  GPRC5A  with  other  clinicopathological 
features  for  all  NSCLCs.  Notably,  GPRC5A  expression  was 
significantly  increased  in  NSCLC  tumor  specimens  from 
never  smokers  (n  =  67)  (mean,  53. 94 ±39. 94;  median,  40; 
minimum,  0;  maximum,  163.33)  compared  with  ever  smokers 
( n  =  407)  (mean,  37.7 1  ±  32.65;  median,  30;  minimum,  0;  max¬ 
imum,  180)  (p  =  0.001)  (Fig.  IQ.  GPRC5A  expression  was 
also  significantly  different  among  never,  former  and  current 
smoker  NSCLCs  (p  =  0.003)  (Fig.  ID).  In  addition,  GPRC5A 
expression  was  significantly  positively  associated  with  well- 
differentiated  tumor  grade  in  a  subset  of  NSCLC  tumors 
( n  =  346)  examined  with  available  grade  information  ( p  =  0.004) 
(Fig.  ID).  It  is  important  to  note  that  when  we  examined  each 
histology  type  separately,  there  were  no  statistically  significant 
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FIGURE  1.  GPRC5A  immunohistochemical  expression  in  NSCLC  histological  tissue  specimens.  A,  Representative  photomi¬ 
crographs  of  GPRC5A  immunohistochemical  expression  in  lung  adenocarcinoma  (left)  and  SCC  (right)  observed  under  a  light 
microscope  at  a  magnification  of  20  x.  Bars,  100  pm.  B,  Box-plots  depicting  differences  in  GPRC5A  immunohistochemical 
expression  among  NBE,  Adeno,  and  SCCs.  C,  Box-plots  depicting  GPRC5A  expression  based  on  tobacco  history  (never  ver¬ 
sus  ever  smokers),  (D),  smoking  status  (never,  former  and  current),  and  (£),  differentiation  grade  (well,  moderate,  and  poor), 
p  values  based  on  the  Wilcoxon  rank  sum  test  and  Kruskal-Wallis  test.  GPRC5A,  G-protein  coupled  receptor  family  C,  group 
5,  member  A;  NSCLC,  non-small-cell  lung  cancer;  SCC,  squamous  cell  carcinomas;  NBE,  normal  bronchial  epithelia;  Adeno, 
adenocarcinomas. 


correlations  between  GPRC5A  expression  and  patient’s  smok¬ 
ing  history  and  status,  and  tumor-differentiation  grade. 

GPRC5A  Immunohistochemical  Expression 
in  NBE  from  COPD  Patients 

We  have  previously  shown  that  loss  of  m Gprc5al 
h GPRC5A  tumor  suppressor  leads  to  chronic-inflammation- 
mediated  prosurvival  signaling  and  transformation  of 
nonmalignant  lung  epithelial  cells.15  The  relevance  of 
GPRC5A  expression  to  human  NSCLC  pathogenesis, 
including  that  associated  with  COPD,  a  chronic  inflammatory 
condition  and  risk  factor  for  lung  cancer,  remains  elusive.  In 
light  of  the  strong  association  between  inflammation,  COPD 
and  NSCLC,10,11  we  sought  to  examine  the  expression  of 
GPRC5A  NBE  in  COPD-associated  adenocarcinoma  and 
SCC.  GPRC5A  expression  was  assessed  in  histological  tissue 
specimens  of  NSCLC  and  NBE,  including  those  from  lung 
cancer-free  COPD  and  from  NSCLC  patients.  Representative 
photomicrographs  of  GPRC5A  immunohistochemical 
staining  in  the  different  NBE  are  depicted  in  Figure  2 A. 
GPRC5A  immunoreactivity  seemed  to  be  highest  in  NBE 
from  lung-disease-free  never  and  ever  smokers,  successively 
significantly  decreased  in  NBE  of  COPD  patients  who  are 
cancer-free,  and  lowest  in  NBE  of  patients  with  both  COPD 
and  adenocarcinoma  or  SCC  (Fig.  25).  A  general  linear  model 


demonstrated  a  significant  gradual  decrease  of  GPRC5A 
expression  from  NBE  of  disease-free  never  smokers  to 
NBE  from  patients  with  COPD  and  adenocarcinoma  or 
SCC  (p  <  0.0001)  (Fig.  2 5).  The  mean  (177.31  ±30.66)  and 
median  (195;  minimum,  130;  maximum,  220)  of  GPRC5A 
expression  in  NBE  of  lung-disease-free  never  smokers 
were  higher  than  that  of  smokers  (mean,  155.28±34.15, 
median,  147.5;  minimum,  100;  maximum,  230)  although 
the  differences  were  not  statistically  significant  (Fig.  2 B). 
Moreover,  GPRC5A  expression  was  significantly  higher 
in  NBE  of  lung-disease-free  smokers  relative  to  NBE  in 
cancer-free  COPD  patients  (mean,  85.48±53.55,  median, 
72.88;  minimum,  20;  maximum,  211)  (p  =  0.007),  which  in 
turn  was  significantly  higher  compared  with  the  expression 
in  NBE  from  COPD  patients  with  adenocarcinoma  (mean, 
48.15±52.84,  median,  35;  minimum,  0;  maximum,  170) 
( p  <  0.0001)  or  SCC  (mean,  32.24±34.9,  median,  12.5; 
minimum,  0;  maximum,  101.69)  (p  <  0.0001)  (Fig.  2 5). 
Furthermore,  although  GPRC5A  expression  was  lower  in 
NBE  of  COPD  patients  with  SCC  compared  with  those 
with  COPD  and  adenocarcinoma,  the  difference  was  not 
statistically  significant  (Fig.  25).  These  findings  suggest  that 
reduced  expression  of  the  GPRC5A  tumor  suppressor  may 
be  implicated  in  the  pathogenesis  of  NSCLC  associated  with 
inflammatory  COPD. 
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FIGURE  2.  Reduced  GPRC5A  expression  in  the  sequential  pathogenesis  of  COPD-associated  NSCLC.  A,  Representative 
photomicrographs  of  GPRC5A  immunohistochemical  expression  in  NBE  from  disease-free  never  and  ever  smokers  (upper 
left  and  upper  right,  respectively)  and  in  NBE  from  cancer-free  COPD  patients  (bottom  left)  and  NSCLC  patients  (bot¬ 
tom  right)  with  COPD  observed  under  a  light  microscope  at  a  magnification  of  x  20.  Bars,  100  pm.  B,  Box-plots  depicting 
quantification  of  GPRC5A  immunohistochemical  expression  represented  in  (A).  Main  p  value  signifies  statistical  significance 
of  reduced  CPRC5A  expression  in  the  sequential  pathogenesis  of  COPD-associated  NSCLC  and  among  the  five  groups  by 
the  general  linear  model  and  Kruskal-Wailis  test,  p  values  signifying  the  indicated  pair  wise  comparisons  among  the  groups 
were  obtained  by  the  Wilcoxon  rank  sum  test.  GPRC5A,  G-protein  coupled  receptor  family  C,  group  5,  member  A;  NBE, 
normal  bronchial  epithelia;  NSCLC,  non-small-cell  lung  cancer;  COPD,  chronic  obstructive  pulmonary  disease;  SCC,  squa¬ 
mous  cell  carcinomas;  NS,  not  significant. 


We  then  examined  GPRC5A  expression  in  epithelial 
cells  from  different  compartments  of  the  lung  (bronchial, 
bronchiolar  and  alveolar  epithelia  and  cells).  GPRC5A 
expression  was  statistically  significantly  lower  in  bronchi 
and  bronchioles  from  cancer-free  COPD  patients  (p  <  0.005) 
and  patients  with  both  COPD  and  SCCs  or  adenocarcinomas 
( p  <  0.0001)  compared  with  lung-disease-free  never  smokers 
(Supplementary  Table  1,  Supplemental  Digital  Content  1, 
http://links.lww.com/JTO/A334).  In  contrast,  GPRC5A 
expression  in  alveolar  compartment  remained  high  (score 
average  172.2-261.5)  in  all  groups  of  cases  examined,  and 
there  were  no  statistically  significant  differences  based  on 


COPD  status  (Supplementary  Table  1,  Supplemental  Digital 
Content  1,  http://links.lww.com/JTO/A334). 

CPRC5A  Expression  in  Smoker  NSCLC  Field 
Cancerization 

Our  previous  findings  on  GPRC5A  tumor-suppressive 
function151619  as  well  as  our  current  observation  on  significant 
and  progressive  reduced  expression  of  this  tumor  suppressor  in 
NBE  from  lung  cancer-free  COPD  patients  and  patients  with 
NSCLC,  prompted  us  to  probe  whether  GPRC5A  expression 
exhibits  a  field  cancerization  pattern  in  NSCLC.  We  sought 
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to  assess  GPRC5A  transcript  expression  in  multiple  bronchial 
sites  ( n  =  29)  as  well  as  in  the  tumors  («  =  6)  and  uninvolved 
normal  lung  parenchyma  (n  =  6)  in  six  NSCLC  cases  (3  SCCs 
and  3  adenocarcinomas).  Quantitative  real-time  PCR  analysis 
demonstrated  progressively  reduced  GPRC5A  expression  from 
normal  uninvolved  lung  parenchyma  (relative  quantification 
[RQ]  mean,  4.18±0.62;  median,  4.16;  minimum,  3.51;  maxi¬ 
mum,  5.04)  to  histologically  normal  and  adjacent  bronchial  epi- 
thelia  (RQ  mean,  3. 12 ±1.94;  median,  2.86,;  minimum,  0.59; 
maximum,  10.12)  to  NSCLC  tumors  (RQ  mean,  1.35±0.71; 
median  1.31;  minimum  0.24;  maximum  2.30),  which  exhibited 
on  an  average  the  lowest  expression  of  the  gene  (Fig.  3 A  and  B) 
( p  =  0.03).  These  findings  suggest  that  GPRC5A  expression  is 
reduced  in  the  smoking-injured  field  cancerization  in  NSCLC 
highlighting  a  potentially  strong  implication  of  this  tumor  sup¬ 
pressor  in  the  pathogenesis  of  this  malignancy. 

DISCUSSION 

In  this  study,  we  assessed  the  expression  of  GPRC5A, 
which  we  had  found  to  exhibit  lung-specific  tumor-suppres¬ 
sor  properties  in  mice,15,1619  in  human  NSCLC  histological 
tissue  specimens,  as  well  as  in  NBE  from  COPD  and  lung 
adenocarcinoma  and  SCC  patients.  We  found  that  the  protein 
product  of  human  GPRC5A  was  lower  in  lung  tumors  com¬ 
pared  with  uninvolved  histologically  NBE,  and  was  signifi¬ 
cantly  associated  with  adenocarcinoma  histology.  GPRC5A 
airway  expression  seemed  to  exhibit  a  progressive  decrease  in 
the  field  cancerization  of  COPD-related  NSCLC  with  highest 
expression  in  bronchial  epithelia  obtained  from  disease-free 
individuals,  intermediate  immunoreactivity  in  normal  epithe¬ 
lia  from  cancer-free  COPD  patients,  and  lowest  in  normal  epi¬ 
thelia  from  patients  with  both  COPD  and  adenocarcinoma  and 
SCC  histologies.  Furthermore,  GPRC5A  transcript  expression 
was  also  lower  in  histologically  NBE  as  well  as  in  adjacent 
and  corresponding  smoker  NSCLC  tumors,  irrespective  of 


COPD  status,  compared  with  uninvolved  normal  lung  sugges¬ 
tive  of  a  field  cancerization-mediated  expression  modulation 
of  this  G-protein  coupled  receptor.  In  light  of  the  relevance 
of  the  field  cancerization  phenomenon  to  NSCLC  pathogen¬ 
esis, 5“7’20,21  our  findings  pinpoint  to  a  potential  tumor-suppres¬ 
sive  role,  similar  to  that  established  in  mice,  of  GPRC5A  in 
the  sequential  development  of  human  NSCLC,  in  particular 
those  associated  with  inflammatory  chronic  obstructive  dis¬ 
ease.  NSCLC  and  COPD  are  both  mainly  attributable  to  ciga¬ 
rette  smoke910,13  and  are  leading  causes  of  deaths  in  the  United 
States  and  worldwide. 1,2,11  Given  that  COPD  is  a  major  risk 
factor  for  lung  cancer  and  shares  various  pathogenic  features 
with  lung  tumors,9,10,22-25  better  molecular  markers  are  needed 
to  identify  which  COPD  patients  will  continue  to  develop  lung 
malignancies.11  Thus,  the  results  herein  raise  the  intriguing 
possibility  that  GPRC5A  loss  may  be  a  useful  biomarker  in 
assessing  the  risk  of  NSCLC  development  in  COPD  patients. 

COPD  and  infections  as  well  as  inflammatory  disorders 
of  the  respiratory  tract  may  be  linked  to  an  increased 
risk  of  lung  cancer.26,27  We  have  previously  demonstrated 
significantly  increased  macrophage  infiltration  into  lungs  of 
Gprc5a-knockout  mice  along  with  their  direct  association 
with  adenocarcinomas,  which  was  accompanied  by  higher 
constitutive  levels  of  proinflammatory  cytokines  and 
chemokines  and  increased  susceptibility  to  stimulation  of 
nuclear  factor-kappa  B  (NF-kB)  activation  in  vivo.15  We  also 
showed  that  loss  of  Gprc5a- mediated  activation  of  NF-kB 
was  causally  linked  to  macrophage  recruitment  and  enhanced 
inflammation  toward  the  creation  of  a  tumor-promoting 
microenvironment.15  It  is  noteworthy  that  lungs  of  patients 
with  severe  COPD  exhibit  macrophage/CD8+  T  cell  and 
neutrophil  infiltration  based  on  histopathological  studies  and 
bronchial-biopsies.28  Moreover,  it  is  worthwhile  to  mention 
that  we  attempted  to  statistically  assess  the  association 
between  expression  of  GPRC5A  and  that  of  NF-kB  (nuclear 
p65  immunoreactivity)  in  NSCLCs  and  found  a  significant 
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FIGURE  3.  Decreased  CPRCSA  mRNA  expression 
in  histologically  NBE  and  corresponding  NSCLC 
tumors  from  resected  specimens  compared  with 
matched  uninvolved  normal  lung.  Total  RNA  was 
isolated  from  brushings  of  NBE  and  frozen  sec¬ 
tions  of  matched  NSCLC  (T)  and  uninvolved  NL 
parenchyma  obtained  from  resected  lobectomy  or 
pneumonectomy  specimens  performed  on  smoker 
early-stage  patients.  A,  CPRC5A  expression  was 
analyzed  in  all  samples  (in  triplicates)  by  QRTPCR 
and  normalized  against  that  of  ACTB  to  yield  a 
relative  expression  by  the  2-aACT  method  depicted 
in  the  graphs.  NSCLC  tumors  (squamous  cell 
carcinoma,  cases  1-3,  and  adenocarcinomas,  cases 
4-6),  uninvolved  NL  parenchyma  and  bronchial 
structures  are  labeled  by  the  indicated  colors.  B, 
CPRC5A  transcript  expression  among  uninvolved 
NL  tissue,  NBE,  and  NSCLC  tumors  was  assessed 
statistically  by  repeated  measures  analysis.  NBE, 
normal  bronchial  epithelia;  NSCLC,  non-small-cell 
lung  cancer;  NL,  normal  lung;  CT,  comparative 
cycle  threshold  method.;  QRTPCR,  quantitative 
real-time  polymerase  chain  reaction. 


6 


Copyright  ©  2012  by  the  International  Association  for  the  Study  of  Lung  Cancer 


09/25/12 


Anjana 


4  color  fig(s);  F#  03:30  Art:  JT0202230 


journal  of  Thoracic  Oncology  •  Volume  XX,  Number  XX,  XX  201 2  CPRC5A  in  COPD-Linked  NSCLC  Pathogenesis 


inverse  correlation  between  both  proteins  in  particular 
in  adenocarcinomas  (data  not  shown).  In  addition,  it  was 
recently  shown  that  the  incidence  of  hyperplastic  lesions  in 
lungs  of  G/>rc5a-knockout  mice  was  increased  after  exposure 
to  nontypeable  Haemophilus  influenzae.29  Notably,  bacterial 
colonization,  particularly  with  nontypeable  Haemophilus 
influenzae,  has  been  implicated  as  a  cause  of  airway 
inflammation  in  COPD  besides  cigarette  smoke.30  In  this 
context,  the  commonalities  between  inflammatory-mediated 
and  histopathological  mechanisms  in  G/?rc5a-knockout  mice 
and  those  evident  in  lungs  of  COPD  patients  along  with  our 
current  findings  on  reduced  GPRC5A  expression  in  normal 
cells  from  COPD  patients  pinpoint  to  a  tumor-suppressive  role 
of  this  gene  in  COPD-associated  human  lung  tumorigenesis. 

We  previously  cloned  the  retinoid-regulated  m Gprc5a! 
h GPRC5A  and  found  that  it  was  preferentially  expressed  in  fetal 
and  adult  mouse  and  human  lung  tissue  compared  with  nor¬ 
mal  specimens  from  other  anatomically  distinct  organs.31  The 
functional  relevance  of  this  lung-specific  expression  was  high¬ 
lighted  by  our  earlier  study  demonstrating  spontaneous  devel¬ 
opment  of  adenomas  and  adenocarcinomas  in  Gprc5a-nockout 
mice,  which  was  not  observed  in  wild-type  littermates.  In  addi¬ 
tion,  spontaneous  tumor  development  in  G/>rc5a-knockout 
mice  was  characterized  by  late  tumor  onset  (12-1 6-month-old 
mice)  and  low  multiplicity.16  Notably,  we  reported  that  expo¬ 
sure  to  the  tobacco-specific  carcinogen  4-(methylnitrosamino)- 
l-(3-pyridyl)-l-butanone  (nicotine-derived  nitrosamine  ketone) 
augmented  tumorigenesis  evidenced  by  6-month  earlier  devel¬ 
opment  of  lesions,  two-  to  fourfold  increased  tumor  incidence 
and  multiplicity,  as  well  as  a  dramatic  increase  in  lesion  size.19 
It  is  worthy  to  note  that,  in  this  study,  we  found  significantly 
reduced  expression  of  GPRC5A  in  human  smoking-injured 
histologically  NBE  adjacent  to  lung  tumors  and  in  the  local¬ 
ized  field  cancerization  of  smoker  NSCLC  patients,  compared 
with  uninvolved  normal  lung.  Moreover,  reduced  GPRC5A 
expression  was  concomitant  among  airways  and  correspond¬ 
ing  NSCLC  compared  with  matched  normal  lung  tissue.  It  is 
plausible  to  assume  that  smoking-mediated  reduced  airway 
expression  of  GPRC5A  in  patients  may  explain  our  previous 
observations  on  augmented  tumorigenesis  in  GprcJa-knockout 
mice  after  exposure  to  tobacco  carcinogens.  Because  COPD  is 
causally  linked  to  smoking,9"1 1,22,23  it  cannot  be  neglected  that 
reduced  GPRC5A  expression  in  NBE  from  COPD  patients 
may  be  because  of  the  molecular  impact  of  smoking  on  the 
airway  transcriptome.  However,  it  is  important  to  mention 
that  we  found  significantly  decreased  GPRC5A  expression  in 
normal  epithelia  from  COPD  patients  compared  with  epithe- 
lia  obtained  from  disease-free  smokers.  Moreover,  GPRC5A 
expression  was  further  decreased  and  lowest  in  normal  airway 
epithelia  from  COPD  patients  with  NSCLC  compared  with 
those  from  cancer-free  COPD  patients.  Thus  in  this  context, 
our  findings  raise  the  possibility  that  GPRC5A  expression  may 
be  impacted  by  cigarette  smoking  and  may  be  both  a  cause  and 
consequence  of  increased  inflammation  in  tumor-promoting 
microenvironment  in  the  lung  as  well  as  NSCLC  development, 
which  may  be  better  delineated  in  future  warranted  studies. 

Our  group  has  previously  shown  that  human  lung  and 
head  and  neck  cancer  cell  lines  exhibit  reduced  GPRC5A 
expression  compared  with  their  normal  counterparts.31  It  is 


worthwhile  to  mention  that  treatment  of  cancer  cells  with 
all-trans  retinoic  acid  restored  GPRC5A  expression  in  cells 
devoid  of  the  tumor  suppressor  in  part  by  a  retinoic  acid 
receptor-dependent  manner.31,32  In  addition  and  in  a  sepa¬ 
rate  study,  we  have  previously  noted  reduced  retinoic  acid 
receptor  beta  (RAR|3)  in  approximately  50%  of  adenocar¬ 
cinomas  and  SCCs  as  well  as  loss  of  RARv  and  retinoid  X 
receptorf  )  in  a  significant  fraction  of  the  tumors.33  Moreover, 
RAR|3  expression  is  decreased  in  bronchial  epithelia  and 
preneoplastic  lesions  representing  the  sequence  of  lung  SCC 
pathogenesis.34  More  recently,  we  demonstrated,  using  gene 
expression  profiling,  that  loss  of  murine  Gprc5a  in  lung  epi¬ 
thelial  cells  reduced  expression  of  markers  of  squamous  dif¬ 
ferentiation  concomitantly  with  an  increase  in  mediators  of 
the  inflammatory  process  (e.g.,  NF-kB  signaling).  In  light 
of  the  biological  connection  between  retinoid  signaling  and 
GPRC5A  expression  as  well  as  our  current  findings  in  this 
revised  article  on  the  significant  reduced  expression  of  the 
tumor  suppressor  in  NBE  of  COPD  patients  including  those 
with  NSCLC,  it  is  plausible  to  suggest  that  retinoid  signal¬ 
ing  may  be  aberrantly  regulated  in  pathogenesis  of  COPD- 
associated  NSCLC.  It  is  important  to  note  that  retinoids  are 
currently  being  tested  for  their  lung  regenerative  properties 
and  effects  in  patients  with  COPD.35 

In  conclusion,  we  report  herein  decreased  expression  of 
GPRC5A  in  human  NSCLC  tissue  compared  with  uninvolved 
NBE  and  in  the  field  cancerization  of  smoker  NSCLCs  pin¬ 
pointing  to  a  tumor-suppressor  role  of  this  G-protein  coupled 
receptor  in  the  pathogenesis  of  this  leading  cause  of  cancer- 
related  deaths.  Moreover,  our  findings  highlight  a  progressive 
decrease  in  GPRC5A  expression  in  the  sequential  pathogen¬ 
esis  of  NSCLCs  arising  in  COPD  patients,  warranting  future 
studies,  including  the  analysis  of  lung  cancer  preneoplastic 
lesions,  to  assess  the  potential  of  the  utility  of  this  gene  as  a 
biomarker  for  lung  cancer  risk  in  COPD  patients. 
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Abstract 


Gene  expression  alterations  in  response  to  cigarette  smoke  have  been  characterized  in  normal¬ 
appearing  bronchial  epithelium  of  healthy  smokers  and  it  has  been  suggested  that  adjacent 
histologically  normal  tissue  display  tumor-associated  molecular  abnonnalities.  We  sought  to 
delineate  the  spatial  and  temporal  molecular  lung  field  of  injury  in  smoker  early  stage  non-small 
cell  lung  cancer  (NSCLC)  patients  (n=19)  who  were  accrued  into  a  surveillance  clinical  trial  for 
annual  follow-up  and  bronchoscopies  within  one  year  after  definitive  surgery.  Bronchial 
brushings  and  biopsies  were  obtained  from  six  different  sites  in  the  lung  at  the  time  of  inclusion 
in  the  study  and  at  12,  24  and  36  months  after  the  first  time  point.  Affymetrix  Human  Gene  1.0 
ST  arrays  were  used  for  whole-transcript  expression  profiling  of  airways  (n=391).  Microarray 
analysis  identified  gene  features  (n=1165)  that  were  non-uniform  by  site  and  differentially 
expressed  between  airways  adjacent  to  tumors  relative  to  more  distant  samples  as  well  as  those 
(n=1395)  that  were  significantly  altered  with  time  up  to  three  years.  In  addition,  gene-interaction 
networks  mediated  by  PI3K  and  ERK1/2  were  modulated  in  adjacent  compared  to  contralateral 
airways  and  the  latter  network  with  time.  Furthennore,  phosphorylated  AKT  and  ERK1/2 
immunohistochemical  expression  were  significantly  increased  with  time  (nuclear  pAKT,  p=0.03; 
cytoplasmic  pAKT,  p<0.0001;  pERKl/2,  p=0.02)  and  elevated  in  adjacent  compared  to  more 
distant  airways  (nuclear  pAKT,  p=0.04;  pERKl/2,  p=0.03).  This  study  highlights  spatial  and 
temporal  cancer-associated  expression  alterations  in  the  molecular  field  of  injury  of  early  stage 
NSCLC  patients  after  definitive  surgery  that  warrant  further  validation  in  independent  studies. 
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Introduction 


Lung  cancer,  of  which  non-small  cell  lung  cancer  (NSCLC)  comprises  the  majority,  is 
the  leading  cause  of  cancer-related  deaths  in  the  US  and  worldwide  (1,2).  The  high  mortality  of 
this  disease  is  in  part  due  to  the  late  diagnosis  of  the  majority  of  lung  cancers  after  regional  or 
distant  spread  of  the  malignancy  (3).  Recent  data  from  the  National  Lung  Screening  Trial  (4), 
indicating  that  screening  increases  early  detection  rates,  are  expected  to  augment  the  number  of 
early  stage  NSCLC  detected  warranting  the  need  for  better  clinical  management  of  this  growing 
subpopulation.  Besides  adjuvant  therapy,  there  are  no  effective  chemoprevention  strategies  for 
early  stage  NSCLC  patients  (5)  who  comprise  approximately  50%  of  all  diagnosed  cases  and 
have  a  relatively  high  rate  of  relapse  (3).  Improved  clinical  management  of  early  stage  NSCLC  is 
tightly  linked  to  the  identification  of  new  effective  early  biomarkers  that  can  guide  potential 
chemoprevention  strategies. 

Most  diagnosed  NSCLC  cases  (85%)  are  attributable  to  cigarette  smoking  (6-8). 
Auerbach  et  al  found  that  cigarette  smoke  induces  extensive  histological  changes  in  the 
bronchial  epithelia  in  the  lungs  of  smokers  and  that  premalignant  lesions  are  widespread  and 
multifocal  throughout  the  respiratory  epithelium,  suggestive  of  a  field  effect  (9).  Many  molecular 
abnormalities,  such  as  loss  of  heterozygosity  (LOH)  (10-12),  mutations  in  TP  5  3  (13), 
methylation  of  pi  6  tumor  suppressor,  death  associated  protein  kinase  (DAPK)  and  retinoic  acid 
receptor  2  beta  (RAR-J32)  were  detected  in  bronchial  epithelia  of  cancer-free  fonner  smokers  (14- 
18)  some  of  which  persisting  for  many  years  after  smoking  cessation  (15).  More  recently,  global 
mRNA  and  microRNA  (miRNA)  expression  profiles  have  been  described  in  the  normal¬ 
appearing  bronchial  epithelium  of  healthy  smokers  (19,  20)  that  are  different  from  those  in  non- 
smokers.  Moreover,  expression  and  pathway  signatures  have  been  derived  from  normal 
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bronchial  epithelium  of  smokers  that  exhibited  diagnostic  properties  (21,  22).  Molecular  changes 
involving  LOH  of  chromosomal  regions  3p  ( DDUT  and  FHIT  genes),  9p  ( CDKN2A ),  genomic 
instability  (increased  microsatellite  repeats)  and  pi  6  methylation  have  been  demonstrated  in 
histologically  normal  epithelium  in  squamous  cell  carcinoma  patients  and  in  the  sequence  of 
pathogenesis  of  the  disease  (14,  23,  24).  Moreover,  Nelson  et  al  demonstrated  that  KRAS  is  also 
mutated  in  histologically  normal  lung  tissue  adjacent  to  lung  tumors  (25).  Furthermore,  Tang  and 
colleagues  found  higher  rates  of  EGFR  mutations  in  adjacent  normal  bronchial  epithelia  (26,  27) 
suggestive  of  a  potential  localized  field  effect. 

It  is  plausible  to  assume  that  understanding  early  molecular  aberrations  in  histologically 
nonnal  smoke-damaged  airway  epithelium  of  early  stage  patients  would  serve  as  a  critical  first 
step  towards  identification  of  biomarkers  that  can  guide  lung  cancer  prevention  strategies. 
However,  the  global  molecular  airway  field  of  injury  in  early  stage  NSCLC  patients,  in  particular 
after  definitive  surgery,  is  unknown.  In  this  study,  we  used  transcript-level  expression  profiling 
coupled  with  gene-interaction  network  analysis  and  immunohistochemical  analysis  to 
characterize,  in-depth,  site-  and  time-dependent  global  molecular  alterations  in  airways  of 
smoker  early  stage  NSCLC  patients. 

Materials  and  Methods 

Patient  population  and  airway  epithelial  cell  collection.  Early  stage  (I/II),  current  or  former 
smoker  NSCLC  patients  with  at  least  a  10-pack-year  smoking  history  and  without  evidence  of 
disease  after  definitive  surgery  were  recruited  into  the  Vanguard  phase  II  surveillance  clinical 
trial  (clinical  trial  number  NCT00352391)  within  one  year  from  time  of  surgery.  Patients  were 
accrued  between  2004-2008  and  underwent  frequent  testing  including  chest  x-rays,  CT  scans, 
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laboratory  work,  serologies,  flexible  bronchoscopies  and  airway  biopsy  collections  within  one 
year  from  surgery  (average,  6  months;  range,  1-12)  (first  time  point),  and  at  months  12,  24,  and 
36  following  the  first  time  point.  Bronchoscopies  were  performed  using  white  light  or  both  white 
light  and  autofluorescence  modalities.  Biopsies  were  obtained  from  all  potential  anatomical 
locations  and  time  points  per  patient.  In  total  there  were  324  evaluable  airway  biopsies. 
Histological  assessment  was  performed  to  determine  whether  malignant  changes  will  occur 
during  the  time  period.  Once  patients  have  completed  3  years  of  testing,  they  were  followed  until 
the  study  is  completed.  Patients  were  comprised  of  former  (n=16)  and  current  (n=3)  smokers. 
One  of  the  three  current  smoker  patients  quit  smoking  6  months  before  the  24  month  time  point. 
The  clinicopathological  variables  of  patients  in  this  study  are  summarized  in  Table  1.  The  study 
was  approved  by  the  Institutional  Review  Boards  and  all  participants  provided  written  informed 
consents. 

Bronchial  airway  epithelial  cells  were  obtained  from  five  to  six  different  sites  (main 
carina,  4  airways  from  4  lobes  and  the  bronchial  stump  or  main  stem  bronchus  adjacent  to  the 
originally  resected  tumor  and  lobe,  Figure  1)  at  each  time  point  using  an  Olympus  fiberoptic 
bronchoscope  (Olympus  America  Inc.,  Center  Valley,  PA)  and  cytobrushes  (Cellebrity 
Endoscopic  Cytobrush,  Boston  Scientific,  Boston,  MA).  Patients  (n=  1 9)  with  samples/specimens 
available  for  analysis  that  were  obtained  serially  up  to  either  24  or  36  months  and  from  at  least 
from  four  different  sites  in  the  lung  at  each  time  point  (n=391  airway  samples)  were  selected  for 
the  study.  Epithelial  cell  content  was  confirmed  by  cytokeratin  staining  which  yielded  a  90% 
epithelial  cell  content  mainly  comprised  of  ciliated  cells.  Brushes  were  immediately  placed  in 
serum-free  RPMI  medium  on  ice,  vortexed  gently  to  disperse  epithelia  into  the  media  and  then 
removed.  Samples  were  then  immediately  centrifuged  and  cell  pellets  were  resuspended  in  1  ml 
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of  phosphate-buffered  saline  (PBS).  500  [xl  of  the  sample  was  then  again  centrifuged  and  the 
pellet  resuspended  in  500  (ll  P-mercaptoethanol-containing  RLT  buffer  (Qiagen,  Valencia,  CA), 
homogenized  and  stored  in  -80°C  until  further  processing.  Total  RNA  was  isolated  using  the 
RNeasy  Mini  Kit  according  to  the  manufacturer’s  instructions  (Qiagen). 

RNA  processing  for  microarrays.  Total  RNA  samples  were  preprocessed  for  subsequent 
hybridization  to  expression  arrays  using  the  WT-Ovation  and  Encore™  Biotin  Module  from 
NuGEN  Technologies  Inc.  (San  Carlos,  CA)  according  to  the  manufacturer’s  instructions. 
Briefly,  the  WT-Ovation  Pico  RNA  amplification  system  was  used  to  generate  amplified  cDNA 
using  5  ng  of  starting  RNA  material.  After  formation  of  double  stranded  cDNA,  DNA  was 
amplified  using  the  SPIA™  amplification  method,  a  linear  isothermal  DNA  amplification 
process  developed  by  the  vendor  (NuGEN  Technologies).  The  WT-Ovation™  Exon  Module 
(NuGEN  Technologies)  was  then  used  for  generation  of  amplified  sense  strand  cDNA  (ST- 
cDNA)  that  is  suitable  for  subsequent  array  analysis  with  the  Affymetrix  Human  Gene  1.0  ST 
array  platfonn  (Affymetrix,  Santa  Clara,  CA).  Fragmented  and  biotin-labeled  cDNA  was  then 
generated  using  the  Encore™  Biotin  Module  (NuGEN  Technologies)  using  5  |ig  of  amplified 
cDNA.  Quality  and  size  distribution  of  unfragmented  SPIA-amplified  cDNA  and  subsequent 
fragmented  labeled  cDNA  were  assessed  by  loading  samples  on  an  RNA  6000  Nano  LabChip 
(Agilent)  and  analysis  with  Agilent  bioanalyzer  2000  (Agilent).  No  differences  in  quality  were 
noted  based  on  duration  of  sample  storage. 

Generation  of  microarray  raw  data  and  analysis.  Fragmented  and  labeled  cDNA  (2.5  |lg) 
were  hybridized  onto  Human  Gene  1.0  ST  arrays  according  to  the  manufacturer’s  instructions 
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(Affymetrix).  Hybridization  cocktails  containing  samples,  control  oligonucleotide  and  eukaryotic 
hybridization  controls  in  addition  to  hybridization  mixes,  DMSO  and  nuclease-free  water  were 
heat  denatured  at  99°C  for  5  minutes,  cooled  to  45°C  for  5  minutes,  and  finally  centrifuged  at 
maximum  speed  for  1  minute.  After  injecting  80  |iL  of  the  hybridization  cocktails,  arrays  were 
incubated  for  17  hours  in  a  hybridization  oven  set  to  a  temperature  of  45°C  with  60  rpm 
rotation.  Arrays  were  washed,  stained  and  processed  using  Affymetrix  GeneChip  Fluidics 
Station  450  systems  after  which  they  were  imaged  using  Affymetrix  GeneChip  Scanner  3000  7G 
for  subsequent  generation  of  raw  data  (*CEL  files).  Raw  data  were  quantified  using  Robust 
Multichip  Array  (RMA)  background  correction,  quantile  normalization  and  RMA  probe-level 
models  (RmaPlm)  summarization  methods.  MIAME  compliant  metadata,  normalized  expression 
values  and  391  CEL  files  were  submitted  to  the  gene  expression  omnibus  (GEO)  (  samples 
GSM992943-GSM993345,  series  GSE40407).  After  data  preprocessing  and  normalization,  a  log 
base  2  transformation  was  applied.  Pathways  and  gene-interaction  network  analyses  were 
performed  using  the  commercially  available  software  Ingenuity  Pathways  analysis.  All  details  of 
the  microarray  analysis  including  pairwise  analysis  of  adjacent  and  contralateral  airways  for 
patient  clustering  are  included  in  the  Supplementary  Information  and  in  the  four  supplementary 
sweave  reports  accompanying  the  manuscript. 

Immunohistochemical  analysis  of  airway  biopsies.  Immunohistochemistry  was  done  on 
histological  sections  of  4  micron  formalin-fixed  paraffin-embedded  tissue  samples  prepared  by  a 
tissue  arrayer  as  described  previously  (28).  Immunohistochemistry  analysis  was  performed  using 
purified  rabbit  polyclonal  primary  antibodies  raised  against  phospho-AKT(Threonine308)  (1:200 
dilution,  clone  C31E5E,  catalog  number  2965)  and  phospho-ERKl/2(Thr202/Tyr204)  (1:400 
dilution,  clone  D13.14.4E,  catalog  number  4370)  (Cell  Signaling  Technology,  Danvers,  MA). 
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Antigen  retrieval  was  performed  using  the  Dako  target  retrieval  system  at  a  PH  of  6  (Dako, 
Carpinteria,  CA).  Intrinsic  peroxidase  activity  was  blocked  by  3%  methanol  and  hydrogen 
peroxide  for  1 5  min  and  serum- free  protein  block  (Dako)  was  used  for  7  min  for  blocking  non¬ 
specific  antibody  binding.  Slides  were  then  incubated  with  the  antibodies  against  phospho-AKT 
and  phospho-ERKl/2  at  room  temperature  for  90  and  120  min,  respectively.  After  three  washes 
in  Tris-buffered  saline,  slides  were  incubated  for  30  min  with  Dako  Envision+  Dual  Link  at  room 
temperature.  Following  three  additional  washes,  slides  were  incubated  with  Dako  chromogen 
substrate  for  5  min  and  were  counterstained  with  hemotoxylin  for  another  5  min.  Formalin- fixed 
and  paraffin-embedded  pellets  from  lung  cancer  cell  lines  displaying  positive  phospho-AKT  and 
phospho-ERKl/2  expression  by  Western  blot  analysis  were  used  as  a  positive  control,  whereas 
samples  and  whole-section  tissue  specimens  processed  similarly,  except  for  the  omission  of  the 
primary  antibodies  were  used  as  negative  controls.  The  intensity  and  extent  of  cytoplasmic  and 
nuclear  phospho-AKT  and  phospho-ERKl/2  immuno staining  was  evaluated  using  a  light 
microscope  (magnification,  x20)  independently  by  two  pathologists  (J.F.  and  I.I.W.)  who  were 
blinded  to  the  identity  of  the  samples.  Cytoplasmic  expression  was  quantified  using  a  four-value 
intensity  score  (0,  none;  1,  weak;  2,  moderate  and  3,  strong)  and  the  percentage  (0-100%)  of  the 
extent  of  reactivity).  A  final  cytoplasmic  expression  score  was  obtained  by  multiplying  the 
intensity  and  reactivity  extension  values  (range,  0-300).  Nuclear  expression  score  was  quantified 
using  the  percentage  of  extent  of  reactivity  (range,  0-100). 

Summary  statistics,  including  frequency  tabulation,  means,  standard  deviations,  median 
and  range  were  given  to  describe  subject  characteristics  and  immunohistochemical  protein 
expression.  Repeated  measures  analysis  was  performed  to  assess  the  differential  effect  of  sites  on 
phosphorylated  AKT  and  ERK1/2  expression  variation  with  time.  Mixed  effects  models  were 
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generated  to  assess  significance  of  site,  time  and  the  interaction  of  both  factors  to  the  expression 
of  both  proteins.  All  statistical  tests  were  two-sided  and  p-values  of  0.05  or  less  were  considered 
to  be  statistically  significant.  Statistical  analysis  was  perfonned  with  standard  statistical 
software,  including  SAS  Release  8.1  (SAS  Institute,  Cary,  NC)  and  S-Plus  2000  (Mathsoft  Inc., 
Seattle,  WA). 

Results 

Detailed  site-  and  time-dependent  airway  sampling  of  the  field  of  injury  in  early  stage 
NSCLC  patients  after  definitive  surgery 

Expression  patterns  molecularly  exemplifying  the  impact  of  smoking  on  the  airway 
epithelium  of  cancer- free  individuals  have  been  characterized  (8,  19,  29).  Moreover,  molecular 
abnormalities  typically  found  in  lung  tumors  have  been  detected  in  normal  resected  margins 
suggestive  of  a  field  effect  (15,  25-27,  30,  31).  However,  the  biological  nature  and  clinical 
relevance  of  the  field  of  injury  in  particular  after  complete  removal  of  the  tumor  in  early  stage 
NSCLC  patients,  who  are  increasing  in  number  and  for  whom  there  are  no  chemoprevention 
strategies,  is  yet  unknown.  Smoker  early  stage  NSCLC  patients  were  recruited  on  a  prospective 
phase  II  surveillance  clinical  trial  that  included  frequent  computed  tomography  (CT)  scans, 
serologies  and  annual  bronchoscopies  in  which  airway  brushings  and  biopsies  were  obtained 
within  one  year  following  tumor  resection  and  at  12,  24  and  36  months  following  the  first  time 
point  (Supplementary  methods  and  Ligure  1).  The  first  time  point  bronchoscopies  were  all 
performed  within  one  year  of  definitive  surgery  (average,  6  months;  range,  1-12).  Nineteen 
patients  were  selected  for  the  study  (Table  1)  based  on  airway  sampling  of  at  least  five  different 
sites  per  time  point  and  continuously  up  to  24  or  36  months  giving  rise  to  391  airway  samples  for 
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transcript-level  expressing  profiling.  The  patients  were  accrued  between  2004  and  2008  and  were 
comprised  of  former  (n=16)  and  current  (n=3)  smokers.  Brushings  and  biopsies  were  obtained 
from  the  main  carina  (MC),  airways  relatively  adjacent  (ADJ)  to  the  previously  resected  tumor 
(ADJ)  and  from  airways  more  distant  from  the  tumor  in  the  ipsilateral  (NON-ADJ)  and 
contralateral  (CONTRA)  lung  (Figure  1). 

Following  normalization  of  the  raw  expression  data,  a  mixed-effects  model  was  applied 
to  characterize  the  expression  pattern  of  each  gene  that  incorporated  fixed  effects  such  as  the  site 
from  the  tumor  and  time  after  surgery  of  the  collected  airway  samples  (Supplementary 
information).  Histogram  p-value  distribution  plots  after  fitting  beta-uniform-mixture  (BUM) 
models  (32)  for  derived  p-values  based  on  the  fixed  effects  suggested  that  both  site  and  time  of 
the  airway  samples  influenced  gene  expression  modulation  (Supplementary  Figures  1A  and  IB). 

Site-dependent  differential  expression  patterns  in  airway  epithelia  of  early  stage  NSCLC 
patients 

We  first  sought  to  determine  whether  gene  expression  profiles  are  differentially 
expressed  in  airways  by  site  from  the  tumor  including  those  relatively  adjacent  to  the  resected 
tumors  compared  to  more  distant  airways.  Based  on  the  generated  BUM  models  and  p-value 
distributions,  genes  differentially  expressed  by  site  were  selected  based  on  a  1%  false  discovery 
rate  (FDR).  We  identified  1165  gene  features  that  were  statistically  significantly  differentially 
expressed  by  site  (Supplementary  Table  1).  Two-dimensional  hierarchical  clustering 
demonstrated  that  the  airway  samples  were  divided  into  two  main  branches  or  clusters  (Figure 
2A)  based  on  expression  of  the  genes.  Moreover,  the  left  cluster  in  the  indicated  heat  map’s 
dendogram  contained  a  statistically  significantly  higher  number  of  adjacent  airway  samples 
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compared  to  the  right  branch  which  contained  a  significantly  higher  proportion  of  main  carinas 
and  contralateral  to  the  tumor  airways  (p=0.0027  of  the  Fisher’s  exact  test  for  count  data).  In 
addition,  the  two-dimensional  clustering  revealed  eight  different  gene  expression  patterns  which 
are  indicated  by  the  color  bar  and  code  along  the  left  side  of  the  heat  map  (Figure  2A).  Some  of 
the  different  gene  clusters  or  classes  were  associated  with  a  specific  group  of  airway  samples. 
Notably,  a  cluster  of  263  genes  (cluster  1,  Supplementary  Table  1),  indicated  by  the  dark  green 
color  and  asterix  on  the  heat  map,  was  found  to  have  highest  average  expression  in  adjacent 
airways  (Figure  2A  and  Supplementary  Table  2).  In  contrast,  another  cluster  of  348  genes 
(indicated  by  magenta  color)  appeared  to  be  highest  in  expression  in  main  carinas  (Figure  2A). 

We  then  determined  to  functionally  analyze  the  cluster  of  genes  (n=263),  that  was  found 
to  exhibit  the  highest  average  expression  in  adjacent  airways,  between  adjacent  and  contralateral 
airways.  Functional  pathways  analysis  using  Ingenuity  Pathways  Analysis  (IPA)  depicted  several 
significantly  modulated  pathways  and  molecular  functions  in  particular,  T-cell  (p=1.2  x  1  O'9), 
chemokine  C-C  motif  receptor  5  (CCR5)  (1.5  x  1  O'9)  and  phospholipase  C  signaling  (p=3.6  x  10' 
9).  In  addition,  topological  gene-interaction  network  analysis  highlighted  functionally  modulated 
and  up-regulated  gene  networks  mediated  by  phosphoinositide  3-kinase  (PI3K)  and  extracellular 
regulate  kinase  (ERK)  in  the  adjacent  airways  (Figure  2B).  It  is  worthwhile  to  note  that  we  also 
observed  increased  modulation  of  ERK  MAPK-mediated  gene-interaction  network  using  a 
different  analytical  method  where  we  compared  expression  profiles  between  adjacent  and 
contralateral  between  patients  in  a  pairwise  fashion  (data  not  shown,  Supplementary  sweave 
report  3).  These  findings  suggest  that  airway  site-dependent  differential  gene  expression  profiles 
in  early  stage  NSCLC  patients  exhibit  increased  molecular  features  and  gene-interaction 
networks  typically  associated  with  cancers. 
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We  then  determined  to  analyze  gene  expression  profiles  while  excluding  main  carinas 
since  their  epithelial  anatomy  is  suggested  to  be  different  from  that  of  other  airways  (33)  and 
thus  may  confound  site-dependent  observations.  Following  exclusion  of  main  carinas,  we  found 
a  reduced  number  of  genes  (n=136)  that  were  significantly  modulated  by  site  in  the  field  of 
injury  (Supplementary  sweave  report  4).  Two-dimensional  hierarchical  clustering  demonstrated 
that  airway  samples  were  divided  into  two  main  branches  or  clusters  (Supplementary  Figure  2) 
with  significantly  more  adjacent  airways  in  the  right  branch  (p=0.0002  of  the  Fisher’s  exact  t- 
test).  Moreover,  the  differentially  expressed  genes  (Supplementary  Table  4)  were  comprised  of 
two  main  subgroups  with  one  cluster  (top  cluster)  of  113  genes  exhibiting  highest  average 
expression  in  adjacent  airways.  It  is  important  to  note  that  when  we  cross-compared  gene 
clusters  that  we  had  found  to  exhibit  the  highest  average  expression  in  adjacent  airways  when 
including  (263  gene  cluster)  or  excluding  (113  gene  cluster)  main  carinas,  we  found  a  highly 
significant  overlap  (p=2.46  x  10'191)  in  the  number  of  genes  (n=96).  Moreover,  the  site- 
dependent  genes  identified  after  exclusion  of  main  carinas  were  also  topologically  organized 
following  functional  pathways  analysis  into  interaction  networks  involving  PI3K  and  ERK. 

Gene  expression  profiles  in  the  lung  airway  epithelia  of  early  stage  NSCLC  patients  are 
modulated  with  time  following  definitive  surgery 

We  then  determined  to  identify  genes  that  were  differentially  expressed  with  time.  Based 
on  the  generated  BUM  models  and  p-value  distributions,  time-dependent  differentially  expressed 
genes  were  identified  and  selected  based  on  a  5%  FDR  cut-off  (n=1395,  Supplementary  Table  4 
and  Supplementary  sweave  report  2).  Hierarchical  clustering  of  samples  indicated  that  the 
dendogram’s  main  branches  were  statistically  significantly  unbalanced  with  respect  to  time;  the 
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main  left  branch  comprised  a  higher  number  of  24  and  36  month  time  points  compared  to  the 
right  cluster  (p=4.15  x  10'7  of  the  Fisher’s  exact  test  for  count  data).  Two-dimensional  clustering 
of  both  genes  and  samples  revealed  two  main  classes  of  genes,  those  that  displayed  increased 
(upper  cluster)  and  decreased  (lower  cluster)  expression  with  time  (Figure  3A).  Functional 
pathways  and  gene-interaction  network  analysis  of  genes  differentially  expressed  genes  between 
36  months  and  the  first  time  point  revealed  statistically  significantly  modulated  pathways,  in 
particular  protein  ubiquitination  (5.3  x  1 0‘5),  glutathione  metabolism  (6.0  x  10‘5),  mitochondrial 
dysfunction  (1.4  x  10'4)  and  oxidative  phosphorylation  (2.9  x  10‘3)  as  well  as  eukaryotic 
initiation  factor  2  (eIF2)  signaling  (2.6  x  10'  ).  In  addition,  network  analysis  highlighted 
functionally  modulated  and  elevated  gene-interaction  networks  with  time  in  particular  those 
mediated  by  Nuclear  factor-kappa  B  (NF-kB),  ERK,  AKT  and  cyclin-B  1  (CCNB1)  (Figure  3B). 

We  then  sought  to  determine  the  relationship  of  genes  that  were  significantly  modulated 
by  site  and  time  in  the  molecular  field  of  injury.  A  smooth  scatter  plot  of  transformed  p-values 
indicated  that  site-  and  time-dependent  expression  modulations  were  largely  independent 
(Supplementary  Figure  3  and  Supplementary  sweave  report  2).  We  then  cross-compared  the  site- 
dependent  (n=1165)  and  time-dependent  (n=1395)  profiles  that  we  had  noted  in  the  molecular 
field  of  injury.  Using  hypergeometric  tests  for  overlapping  probability,  we  found  no  significant 
overlap  between  genes  we  had  determined  to  be  differentially  expressed  by  site  and  time  in  the 
molecular  field  of  injury  (p=0.865)  (Supplementary  sweave  report  4). 
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Increased  expression  of  phosphorylated  AKT  and  ERK1/2  in  airway  epithelia  by  site  from 
the  tumor  and  with  time  following  surgery 

Our  findings  on  the  modulation  of  PI3K-  and  ERKl/2-mediated  networks  by  site  and 
time  after  surgery  prompted  us  to  examine  the  immunohistochemical  (IHC)  expression  of 
surrogate  markers  of  both  signaling  cascades  in  corresponding  formalin- fixed  paraffin  embedded 
(FFPE)  airway  biopsies.  We  sought  to  examine  expression  of  AKT  phosphorylated  at 
Threonine308  since  phosphorylation  of  this  amino  acid  is  well  known  to  occur  through 
phosphoinositide-dependent  kinase  1  (PDPK1)  following  PI3K  activation  (34).  We  assessed  by 
IHC  the  immunohistochemical  expression  of  phospho-AKT(Thr308)  and  phospho- 
ERKl/2(Thr202/Tyr204)  in  available  and  evaluable  histologically  normal  bronchial  epithelia 
biopsies  (n=324)  corresponding  to  the  brushings  analyzed  by  expression  profiling. 
Immunoreactivity  of  phospho-AKT  (min  0,  max  300)  was  variable  as  depicted  in  the 
representative  photomicrographs  in  Figure  4 A  and  was  detected  in  both  the  cytoplasm  and 
nucleus  of  normal  bronchial  epithelia  (NBE)  (Figure  4A).  IHC  analysis  demonstrated  that 
cytoplasmic  (p<0.0001)  and  nuclear  (p=0.01)  phospho-AKT  statistically  significantly  increased 
with  time  up  to  three  years  in  all  NBE  (Figure  4B)  with  highest  expression  at  the  36  month  time 
point.  Nuclear  phosphorylated  AKT  was  also  statistically  significantly  increased  in  adjacent 
NBE  compared  to  relatively  airways  more  distant  from  tumors  in  the  mixed-effects  model 
(p=0.04)  (Figure  4B). 

Immunoreactivity  of  phospho-ERKl/2  was  also  variable  (min  0,  max  209)  and  mainly 
cytoplasmic  (Figure  4C).  IHC  analysis  demonstrated  that  phosphorylated  ERK1/2  was 
statistically  significantly  elevated  in  adjacent  NBE  (p=0.03)  and  significantly  increased  with  time 
up  to  three  years  in  all  airways  when  averaged  together  (p=0.02)  (Figure  4D)  in  the  model. 
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Notably,  there  was  a  significant  interaction  (p=0.019)  between  site  of  NBE  and  time  of  sampling 
as  phospho-ERKl/2  expression  was  significantly  increased  in  adjacent  NBE  but  not  in 
contralateral  airways  and  main  carinas  in  the  model  (Figure  4D)  with  highest  expression  in 
adjacent  airways  and  non-adjacent  (ipsilateral,  green  plot)  airways  in  observed  at  the  latest  time 
point.  Similar  data  were  obtained  when  we  excluded  main  carinas  in  the  mixed-effects  model 
(data  not  shown).  Moreover,  we  noted  similar  findings  when  all  airway  samples  were  analyzed 
irrespective  of  the  presence  of  preneoplastic  lesions  (e.g.  dysplasias). 

These  data  demonstrated  that  like  differential  gene  expression  profiles  within  the  lung 
airway  field  of  injury,  canonical  activated  oncogenes  are  modulated  by  site  from  the  resected 
tumor  and  time  following  definitive  surgery  in  early  stage  NSCLC  patients. 

Discussion 

In  this  report,  we  characterized  differential  expression  profiles  and  protein  expression 
within  the  lung  airway  field  of  injury  of  early  stage  NSCLC  patients  by  site  from  the  tumor  and 
time  in  years  following  surgery.  We  demonstrated,  and  to  our  knowledge  for  the  first  time,  that 
gene  expression  profiles  in  histologically  nonnal  airways  of  early  stage  NSCLC  patients  are  non- 
uniform  by  site  and  are  modulated  with  time  up  to  three  years  following  surgery.  Moreover, 
functional  analysis  of  the  expression  profiles  demonstrated  that  canonical  expression  patterns 
and  protein  kinase  activation,  typical  of  tumors,  are  increased  in  airway  sites  adjacent  to  tumors 
as  well  as  remain  or  are  further  modulated  in  the  lung  airway  field  of  injury  for  three  years  after 
definitive  surgery.  In  particular,  phosphorylated  ERK  MAPK  and  AKT  expression  were  up- 
regulated  in  normal  bronchial  epithelia  with  time  and  by  site  from  tumors.  In  light  of  the  growing 
sub-population  of  early  stage  NSCLC,  our  findings  are,  in  part,  proof  of  principle  and  raise  the 
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intriguing  possibility  of  the  importance  of  intense  surveillance  and  molecular  characterization  of 
the  remaining  smoking-injured  airway  epithelia  and  its  potential  integration  in  the  future  into 
clinical  practice  and  management  of  early  stage  disease.  However,  it  is  noteworthy  that  our 
study’s  patient  cohort,  despite  its  uniqueness  in  which  expression  profiling  was  performed  on 
airways  from  multiple  sites  collected  during  bronchoscopies  serially  performed  for  36  months 
following  surgery,  is  of  limited  size.  Moreover,  the  reported  findings  warrant  the  need  for 
validation  or  confirmation  in  independent  larger  sets. 

There  is  a  large  body  of  evidence  that  patients  who  have  survived  an  upper  aerodigestive 
cancer  comprise  a  high  risk  population  that  may  be  targeted  for  early  detection  and 
chemoprevention  efforts  (5,  31).  Currently  there  are  no  established  adjuvant  treatments  in  the 
tertiary  prevention  setting  of  early  stage  NSCLC  patients.  It  has  been  suggested  that  failures  in 
advances  of  chemoprevention  are  in  part  due  to  lack  of  clear  and  specific  molecular  targets  (5, 
35).  Our  extensive  profiling  of  the  molecular  field  of  injury  in  NSCLC  patients  identified  cancer- 
associated  pathways  (PI3K  and  ERK)  aberrantly  regulated  in  normal  bronchial  epithelia  of 
NSCLC  patients  after  tumor  removal.  In  this  context,  it  is  plausible  to  suggest  that  a  thorough 
characterization  of  the  molecular  field  of  injury  in  early  stage  NSCLC  patients  can  aid  in 
identification  of  aberrantly  expressed  pathways,  e.g.  PI3K  and  ERK,  which  could  potentially 
serve  as  suitable  targets  for  chemoprevention.  However,  it  cannot  be  neglected  that  the 
alternative  hypothesis  can  counter  argue  that  activation  of  PI3K  and  ERK  MAPK  pathways  may 
be  beneficial  for  chemoprevention  since  markers  of  both  pathways  (phosphorylated  AKT  and 
ERK)  increased  following  surgical  tumor  resection.  Our  suggestion  that  such  pathways  may 
serve  as  chemoprevention  targets  should  be  interpreted  cautiously  and  is  presented  owing  to  the 
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known  promalignant  function  of  the  pathways  and  gene  networks  highlighted  in  our  analysis 
(36). 

The  first  time  point  brushings  in  this  study  were  obtained  within  one  year  of  patient 
accrual  and  not  at  the  time  when  the  tumor  was  still  in  situ.  The  variable  starting  time  point  from 
surgery  among  patients  is  a  limiting  factor  in  our  analysis  since  it  is  plausible  to  assume  that  the 
molecular  field  of  injury  may  vary  upon  tumor  removal.  We  were  not  able  to  avoid  this  caveat 
given  the  difficulty  of  accrual  of  patients  following  surgical  tumor  resection  to  obtain  bronchial 
brushings  at  six  different  anatomical  sites  in  the  bronchial  tree  annually  up  to  three  years. 
However,  it  is  important  to  mention  that  the  time-dependent  gene  expression  profiles  we  had 
identified,  despite  not  incorporating  the  molecular  field  effect  at  the  time  the  tumor  was  present, 
demonstrated  gradual  changes  in  expression  with  time.  This  effect  was  also  evident  when 
analyzing  the  immunohistochemical  expression  of  phosphorylated  AKT  and  ERK  in 
corresponding  bronchial  biopsies  with  highest  expression  at  three  years.  Our  findings  warrant 
future  larger  studies  in  which  the  molecular  field  of  injury  at  the  time  the  tumor  is  still  in  situ  can 
also  be  serially  monitored  for  several  years. 

The  patient  population  we  had  studied  was  comprised  of  early  stage  NSCLC  patients, 
which  is  in  contrast  to  earlier  transcriptomic  studies  of  the  molecular  field  of  injury  that  focused 
on  phenotypically  normal  smokers  and  non-smokers  (19,  21,  22).  It  is  still  not  clear  whether  the 
differences  in  expression  described  in  this  study  reflect  an  already  present  gradient  field  of  injury 
that  may  have  contributed  to  tumor  development  in  light  of  the  differential  cancer-associated 
pathways  identified  or  one  that  arises  due  to  the  molecular  impact  of  the  tumor  on  the  adjacent 
field.  It  is  important  to  note  that  in  this  study,  the  spatial  and  temporal  molecular  field  of  injury 
in  lung  cancer  patients  was  profiled  prospectively  starting  within  one  year  following  definitive 
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surgery.  Thus,  the  above  speculation  may  be  addressed  by  a  similar  thorough  spatial  and 
temporal  characterization  of  the  molecular  field  of  injury  before  and  after  surgery  in  early  stage 
patients.  In  addition,  we  did  not  have  access  to  similar  type  of  brushings  from  cancer-free 
individuals  such  as  high-risk  smokers.  Similar  analysis  of  the  molecular  field  of  injury  in  cancer- 
free  individuals  will  shed  light  on  the  nature  of  site-  and  time-dependent  expression  patterns  in 
the  field  of  injury  and  whether  such  changes  in  cancer  patients  reflect  recovery  from  surgery 
(temporal  profiles)  or  are  a  cause  or  consequence  of  tumor  development  in  the  adjacent  field 
(spatial  profiles). 

In  conclusion,  our  unique  study  identified  gene  expression  profiles,  functional  gene- 
networks  and  activated  levels  of  oncogenic  protein  kinases  within  the  field  of  injury  of  early 
stage  NSCLC  patients  that  are  modulated  or  increased  in  airways  spatially  from  the  tumor  and 
temporally  following  surgery.  Moreover,  the  herein  previously  uncharacterized  airway  cancer- 
associated  expression  and  protein  kinase  alterations  harbor  potential  valuable  targets  for 
chemoprevention  and  warrant  confirmation  and  further  studies  in  larger  independent  cohorts. 
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Table  1.  Clinicopathological  features  of  NSCLC  patients  included  in  the  study 


Patient 

Histology 

Anatomical 

site 

Stage 

Recurrence 

Adjuvant 

chemotherapy 

Gender 

Age 

Smoking 

status 

Pack 

years 

Years 

quit 

smoking 

Months  to 
inclusion 
from 
surgery 

i 

ADC 

LUL 

IIA 

Yes 

No 

Male 

81 

Former 

100 

12.0 

i 

2 

ADC 

RLL 

IB 

No 

No 

Male 

64 

Former 

27 

19.1 

5 

3 

ADC 

RUL 

IB 

Yes 

Yes 

Male 

58 

Former 

12 

25.5 

7 

4 

SCC 

RLL 

IB 

Yes 

Yes 

Female 

62 

Former 

60 

7.6 

7 

5 

ADC 

RLL 

IA 

No 

No 

Female 

60 

Former 

31 

11.8 

6 

6 

ADC 

LLL 

IA 

Yes 

Yes 

Male 

62 

Former 

40 

5.8 

5 

7 

ADC 

RUL 

IB 

Yes 

Yes 

Male 

61 

Current 

70 

NA 

10 

8 

SCC 

LUL 

IB 

No 

No 

Male 

64 

Current 

100 

NA 

5 

9 

ADC 

RUL 

IA 

No 

No 

Male 

69 

Former 

12 

36.5 

8 

10 

ADC 

RUL 

IB 

No 

No 

Female 

63 

Former 

18 

8.6 

5 

11 

ADC 

RUL 

IB 

Yes 

No 

Female 

53 

Former 

42 

5.8 

3 

12 

SCC 

RUL 

IA 

No 

No 

Male 

62 

Former 

68 

0.4 

5 

13 

SCC 

LUL 

IA 

No 

No 

Male 

70 

Former 

96 

1.4 

7 

14 

ADC 

LUL 

IA 

Yes 

Yes 

Female 

45 

Former 

12.5 

0.9 

12 

15 

ADC 

RLL 

IA 

No 

No 

Female 

57 

Current 

80 

NA 

4 

16 

ADC 

LLL 

IA 

No 

No 

Male 

65 

Former 

48 

21.0 

7 

17 

ADC 

LLL 

IA 

No 

No 

Male 

64 

Former 

92 

5.1 

7 

18 

ADC 

RML 

IB 

No 

No 

Male 

71 

Former 

102 

2.3 

2 

19 

ADC 

LUL 

IA 

No 

No 

Female 

66 

Former 

50 

18.5 

5 

*  Anatomical  site:  location  of  primary  tumor  in  the  lung  (LUL,  left  upper  lobe;  LLL,  left  lower 
lobe;  RUL,  right  upper  lobe;  RML,  right  middle  lobe;  RLL,  right  lower  lobe 

*  Smoking  status:  smoking  status  at  time  of  inclusion  into  the  study  (patient  7  quit  smoking 
during  the  course  of  the  study). 

* Years  quit  smoking:  Years  from  smoking  cessation  to  time  of  inclusion  into  the  study. 

*  Months  to  inclusion  from  surgery:  Months  elapsed  from  surgery  to  time  of  inclusion  into  the 
study/first  bronchoscopy  time  point. 

ADC,  adenocarcinoma;  SCC,  squamous  cell  carcinoma 
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Figure  legends 

Figure  1.  Spatial  and  temporal  molecular  field  of  injury  in  early  stage  NSCLC  patients 
after  definitive  surgery.  Schematic  depicting  the  site-  (top)  and  time-  (bottom)  dependent 
collection  of  airway  epithelia  brushings  by  bronchoscopy.  Smoker  early-stage  NSCLC  patients 
were  enrolled  into  a  surveillance  clinical  trial  for  annual  follow-up  and  bronchoscopies  within 
one  year  after  definitive  surgery.  Bronchial  airway  epithelial  cells  (brushings  and  biopsies)  were 
obtained  from  live  to  six  different  sites  comprised  of  the  main  carina  (MC),  4  airways  from  4 
lobes  ipsilateral  (NON-ADJ)  or  contralateral  (CONTRA)  to  the  originally  resected  tumor  and  of 
the  bronchial  stump  or  main  stem  bronchus  adjacent  to  the  tumor  and  lobe.  All  site-different 
airway  epithelia  were  collected  at  the  time  of  inclusion  in  the  study  and  at  12,  24  and  36  months 
following  the  starting  time  point  (391  airways  from  19  NSCLC  patients). 

Figure  2.  Site-dependent  airway  epithelia  differential  gene  expression  patterns.  A.  Heat  map 
depicting  two-dimensional  clustering  of  airway  samples  (n=391)  and  genes  (n=1165)  that  were 
determined  to  be  differentially  expressed  by  site  in  the  mixed-effects  model  based  on  a  1%  FDR 
cut-off.  The  identified  eight  gene  clusters  are  labeled  with  different  colors  with  the  green  cluster 
of  genes  (n=263,  *A)  exhibiting  highest  average  expression  in  adjacent  airways  and  the  magenta 
cluster  (n=348,  *MC)  having  highest  expression  in  main  carinas.  B.  Gene-interaction  analysis  by 
IPA  depicting  networks  with  significant  scores  that  indicate  the  likelihood  of  genes  in  a  network 
being  found  together  than  due  to  chance.  The  depicted  networks  were  found  to  be  mediated  by 
P13K  (top)  and  ERK1/2  (bottom)  with  both  kinases  themselves  not  modulated  in  expression. 
Gene  expression  variation  based  on  the  statistical  cut-off  described  above  is  depicted  by  color  in 
the  network  (red,  up-regulated;  green,  down-regulated). 
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Figure  3.  Temporal  modulation  of  the  molecular  field  of  injury  after  definitive  surgery  in 
early  stage  NSCLC  patients.  A.  Heat  map  depicting  two-dimensional  clustering  of  airway 
samples  (n=391)  and  genes  (n=1395)  that  were  determined  to  be  differentially  expressed  by  time 
in  the  mixed-effects  model  based  on  a  5%  FDR  cut-off.  B.  Gene-interaction  analysis,  similar  to 
that  in  Figure  2,  by  IPA  depicting  networks  with  increased  likelihood  of  genes  being  found 
together  than  due  to  chance  and  mediated  by  NF-kB,  ERK1/2,  AKT  and  CCND1.  CCND1  itself 
was  up-regulated  at  the  expression  level.  Gene  expression  variation  based  on  the  statistical  cut¬ 
off  is  depicted  by  color  in  the  network  (red,  up-regulated;  green,  down-regulated). 

Figure  4.  Site-  and  time-dependent  immunohistochemical  expression  of  phosphorylated 
AKT  and  ERK1/2  in  the  airway  field  of  injury.  A.  Representative  photomicrographs  (20x 
magnification)  depicting  strong  (top)  and  weak  (bottom)  phospho-AKT(Thr308) 
immuno staining.  B.  Immunohistochemical  scores  of  cytoplasmic  (left)  and  nuclear  phospho- 
AKT  (middle)  were  assessed  for  statistically  significant  differences  by  site  and  time  in  a  mixed- 
effect  model  and  plotted  in  main  carinas  (MC)  and  in  adjacent  (ADJ),  non-adjacent  (ipsilateral, 
NON-ADJ)  and  contralateral  (CONTRA)  airways  with  time.  C.  Representative 
photomicrographs  (20x  magnification)  depicting  strong  (top)  and  weak  (bottom)  phospho- 
ERKl/2(Thr202/Tyr204)  immunostaining.  D.  Immunohistochemical  scores  of  phosphorylated 
ERK1/2  levels  were  assessed  for  statistically  significant  differences  by  site  and  time  in  a  mixed- 
effect  model  and  repeated  measures  analysis,  site*time,  term  for  interaction  between  site  and 
time  factors.  Error  bars  represent  S.E.  of  the  mean. 
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